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Abstract. This document presents ^Annotated English^, a system of 
diacritical symbols which turns English pronunciation into a precise and 
unambiguous process. The annotations are defined and located in such a 
way that the original English text is not altered (not even a letter), thus 
allowing for a consistent reading and learning of the English language 
with and without annotations. The annotations are based on a set of 
general rules that makes the frequency of annotations not dramatically 
high (despite the chaotic orthography of English). This makes the reader 
easily associate annotations with exceptions, and makes it possible to 
shape, internalise and consolidate some rules for the English language 
which otherwise are weakened by the enormous amount of exceptions in 
English pronunciation. 

The advantages of this annotation system are manifold. Any existing text 
can be annotated without a significant increase in size. This means that 
we can get an annotated version of any document or book with the same 
number of pages and fontsize. Since no letter is affected, the text can 
be perfectly read by a person who does not know the annotation rules, 
since annotations can be simply ignored. The annotations are based on 
a set of rules which can be progressively learned and recognised, even 
in cases where the reader has no access or time to read the rules. This 
means that a reader can understand most of the annotations after reading 
a few pages of '"Annotated English^, and can take advantage from that 
knowledge for any other annotated document she may read in the future. 
These features pave the way for multiple applications. ^Annotated English^ 
can be used as a tool for teachers and parents when English-speaking 
children learn English orthography or simply when they start reading. 
Annotated textbooks, tales, dictionaries and any other material can make 
English orthography less painful. '"Annotated English"' can also be very 
useful for students of English as a foreign language, since pronunciation 
is terribly troublesome for them, because they need to learn the mean- 
ing, the spelling and the pronunciation for each word they learn, and the 
pronunciation does not improve at all by reading. This is so because we 
can only see the spelling and infer the meaning, but books do not provide 
the correct pronunciation for each word. In fact, incorrect pronunciation 
(we are not referring here to a bad accent or intonation) persists in peo- 
ple who has lived in an English-speaking country for decades, because 
the true pronunciation is never shown (unless the IPA transcription is 
looked up in a dictionary), just heard in different contexts and situations. 
Finally, ^Annotated English^ can also be very practical for any regular 



user (native or not) of the English language, very especially when fac- 
ing new (especially technical) words and they have doubts about their 
pronunciation. 

In this document we introduce the rules for annotations and its symbols. 

This document is not intended for a general audience, and the expla- 
nations arc focussed to precisely understand how the annotation system 
works. It is obvious that if the system has to be explained to a final 
user, a shorter and simpler manual should be issued for that. In any 
case, for learning '"Annotated English'', the best thing is to take a look at 
the examples. There is a section in this document with some annotated 
texts. It is recommended to take a quick look at them before getting into 
details. 

Keywords: English spelling, English pronunciation. Phonetic rules. Di- 
acritics, Pronunciation without Respelling, Spelling Reform. 

1 Introduction 

English is the lingua franca of the world. This means that it is used by many 
people from different countries and native languages. This also means that it 
is used with many different proficiency levels. A proper coordination between 
written and spoken English is only achieved by the few. The main reason for 
this is the deficient spelling system of the English language. We are not going to 
discuss on the many causes (the Great Vowel Shift, the lack of an Academy of the 
English Language acting as regulator, the nature of English as an amalgamation 
of words from many different languages). The issue is that it is a fact that English 
spelling is problematic. But it is also a fact that any spelling reform (shallow or 
deep) has been stubbornly doomed to failure. 

Spanning from several centuries ago, reforms can be distinguished into those 
which focus on spelling and those which focus on pronunciation. Many languages 
in the world based on the Roman alphabet produce one single pronunciation from 
a given spelling (Spanish, German, Italian, and French, are examples of this) , but 
almost no language produce one single spelling given a pronunciation (Italian 
is perhaps the closest one to that goal among the major European languages). 
Additionally, there have been minor reform proposals (as those undertook by 
American English in the nineteenth century), major ones (Webster proposed to 
go further beyond) and drastic reforms. There are also several organisations for 
English spelling reform^. 

If we take a look at some of the most recent proposals, we can get an idea 
of why they failed. For instance, Basic Roman Spelling [2] preserves the alpha- 
bet {''Luk thru dha senta av dha muun hwen it iz bluu" "look through the 
center of the moon when it is blue"), while it oversimplifies pronunciation, in 
order to supposedly give one-to-one correspondence between spelling and pro- 
nunciation. Similarly, Interspel [10] preserves the alphabet but it is closer to 

^ One of them is the spelling society: http://www.spellingsociety.org/aboutsss/position.php, 
which does not currently advocate any specific solution, but supports any study 
and progress on this issue. 
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the original English spelling: "English will luse its position as the wurld's lingua 
franca, which it inherited for politicl and comercial reasns. The rest of the wurld 
has problems lerning two English languages, the spoken and the written, when it 
shud he posibl to lem the one from the other''' . The one-to-one correspondence 
is lost, but pronunciation can be better determined (but not unambiguously). 
The first feeling after reading this is that a lot of words should be corrected. We 
have devoted a lot of effort to learn the correct spelling of each word in order 
to mangk; everything again. Other approaches try to avoid this confusion by 
using extended languages, such as John Malone's Unifon [4] , but going out from 
the Latin alphabet is a strong bet. Additionally, many of these reforms do not 
produce good pronunciations in some cases. For instance, most of them ignore or 
give a secondary role to stress, which is one of the main issues for a good pronun- 
ciation of English vowels, and to distinguish between many pairs (verb/noun), 
such as 'a present' and 'to present' (which are still different because of the vowel 
sounds), as well as 'an imprint' and 'to imprint' (which have exactly the same 
vowel pronunciation, only differing on the stress). 

There arc; sciveral problems about these proposals. They try to change the 
langTiagc spelling. This requires a broad consensus and an academy or work- 
ing group to undertake them. Even in languages where minor cliangcis have been 
made (e.g. Portuguese and German undertook minor reforms in the last decades), 
this has been difficult and has required almost a generation to settle. Any re- 
form involves that old texts become obsolete and a lot of retraining is required, 
as well as reprinting textbooks, documents and almost every written thing. But 
even ignoring all these major problems, there are many others, such as the loss 
of word etymology. This loss of word origin is not a picky thing for linguists; 
most people would not care about 'solid' being spelled as 'sollid', even if it is 
not etymological, because it comes from Latin 'solidus'. The problem is that any 
reform which would not preserve some basic etymology would make the learning 
of some words difficult to native speakers and especially foreigners (e.g. 'fone' 
is more difficult to recognise than 'phone', 'Rusha' than 'Russia', 'Krischan' 
than 'Christian', 'memmcri' than 'memory', 'langwidgc' than 'language', 'Ar- 
keeopteriks' than 'Archaeopteryx', 'owshan' than 'ocean'). This would obscure 
the relation among words with the same root. For instance, 'sine' (for 'sign') 
would make its relation with 'signal' much less evident. In addition, thousands 
of homographs ('to , 'two' and 'too'; 'male' and 'mail'; 'eye', T; 'by', 'buy' and 
'bye'; etc) would be created. Another problem would be the incorporation of 
new technical (e.g. biological and chemical) words. For instance, 'Arkceoptcriks' 
and 'Akwa' (for 'Aqua') would be very awkward. And, finally, one above all: we 
like the language as it is and we want to learn and work with one language, not 
with two languages (the old and the new). And finally, any spelling reform would 
have a dramatic impact in computational linguistics and thousands of computer 
applications which process English automatically, such as editors, spell check- 
ers, searching engines, etc. All these reasons arc much more important that the 
reluctance of users, the lack of consensus, the non-existence of an Academy of 
the English Language, or the Great Vowel Shift. English spelling reforms have 
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not succeeded simply because it is not true that everything would be beneficial 
once the transformation would be completed. 

So, is there any way to systematically know how a word in English is pro- 
nounced without a spelling reform? Yes, of course. This is called the phonetic 
transcription of a word. For instance, we can look up any word in the dictionary 
and SQC its IPA (International Phonetic Alphabet) transcription, where it says 
that 'explanation' is pronounced [,eksplo'ncij9n]. Therefore, the English pronun- 
ciation problem seems to be solved^. However, when we learn the language or 
we read a book in English, we do not check the phonetics for every word. Com- 
puter technology makes it possible to add an extra line to each line of text with 
the IPA transcription. However, this would double book sizes and would make 
readers switch repeatedly from English to IPA and vice-versa, as having a book 
in two languages or with subtitles. 

Instead of that, our proposal is to keep English unaltered, but to annotate its 
pronunciation over the very word, not apart from it. We can, of course, find some 
precedents. For instance, many editions of the Webster's Elementary Spelling- 
book (e.g. the 1836's edition [9]) contained a set of diacriticals which helped to 
learn the pronunciation of each word. The words were not respelled. Instead, 
"marked letters" were used, as in 'son' or 'get'. Some other dictionaries also 
used this philosophy until the mid XXth century, when respelling became more 
popular (and the pronunciation of 'son' was explained by respelling it like 'sun', 
and 'pleasure' was respelled as 'plesher'). Of course, things changed dramatically 
when the IPA popularised, and even though the respelled pronunciation is still 
common in dictionaries (most especially in America) , the idea of pronunciation 
without respelling (the "marked letters') faded away. Apart from (or out of) En- 
glish language textbooks and dictionaries, none of these systems have been used 
profusely, because the systems have not been designed to mark (or annotate) a 
whole text, but to clarify the pronunciation of some difficult words. In fact, it is 
still seen in some editions of the Bible, since most of their readers have difficulties 
on the pronunciation (and meaning) of words which are not frequent nowadays. 
There are also some children books which colour some letters to show their pro- 
nunciation, based on the ambitious (and once popular) respelling system known 
as the Initial Teaching Alphabet [3]. 

However, these approaches (including the ones used in old dictionaries) were 
incomplete (not all possible sounds were covered) and unsystematic (not in- 
tended to be used for every word) in such a way that an entire document can be 
annotated. The reason-why is that these approaches had almost no by-default 
rules, and using them for a whole document would have rendered it full of anno- 
tations, at a ratio of many annotations per word (something similar to what is 
shown in Table 1 (left)). For instance, there were marks for words such as 'cat' 
and 'cell', even though they look pretty regular to any English speaker. Addition- 
ally, they were not considering a possible automation of the process, since these 

^ In fact, we could write IPA instead (the most drastic and accurate spelling reform 
one can imagine), and we would have a one-to-one correspondence between spelling 
and pronunciation. 
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diacritical systems were conceived before computers spread over. And, finally, 
they were created at a moment when English phonology was not so well studied 
as it is today, and wore typically focusscd on one single dialect of English. 

Our proposal is not just a modernisation of these early approaches to pro- 
nunciation without respelling. It is different in philosophy and basic principles. 
The basic idea is to develop a set of symbols (marks or annotations, whatever 
we want to call them), but also a set of rules, in such a way that we can anno- 
tate some words which do not follow the rules, but we keep the parts that are 
regular free from annotations. With this, we do have the pronunciation, we also 
have the original English word, and we also see the parts which have a regular 
pronunciation and which parts do not. The idea is to devise a transformation 
code, in such a way that using these rules and the annotations we can extract 
the right pronunciation from a word spelling. The only caveat is that, as we will 
see, even with a well-established set of annotations and rules, an English word 
can be annotated in several different ways to get the same pronunciation. The 
solution, as we will see, is to use the way which minimises the annotations (or 
some measure of the annotation cost). 

As a result, ''Annotated English"' is: 

— A system of diacritical annotations which are placed around (above and 
below) the words to indicate its true pronunciation. So it follows the old 
motto "pronunciation without respelling" . 

— A set of rules which are designed to reduce the proportion of required anno- 
tations. This keeps the text as clear as possible but, very importantly, helps 
to understand the spelling rules in English. A reader can focus on the general 
rules since exceptions are just annotated. 

— While reading ''Annotated English^, you are reading English. Annotations 
can be ignored by a proficient reader. 

— Every new word which is learned when reading an annotated book comes 
with both its English spelling and its pronunciation. When a person reads in 
''Annotated English'', she can improve and learn both spelling and pronunci- 
ation. Both things at the same time and in the same space. 

— No rule can depend on the meaning of the word, its position or its kind 
(plural/singular, word origin). For instance, annotations must be necessary 
different for homographs which are not homophones, as in 'a present' and 
'to present', or 'to read' and 'have read'. 

— Because of the previous item, the pronunciation (interpretation) process can 
of course be automated, so an annotated document can be pronounced au- 
tomatically by a robotic or a computer system^. 

^ There are many voice synthesisers which have disambiguation tools so 'I read now' 

and 'I read yesterday' are well pronounced. However, these synthesisers fail in other 
more complex cases, so being amusing for some users but crossing some others (e.g. 
blind people). This would not happen for an annotated text since disambiguation 
would not be needed here. Apart from that, there are intonation issues in voice 
synthesisers we are not concerned here. 
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— The annotation (coding) process can also be automated^, so electronic doc- 
uments and books can be converted from plain (traditional) English into 
^Annotated English"', to be printed on paper or read over an e-book or a 
computer screen. 

— It tries to be compatible with most English dialects, while also allowing 
different annotations in case a word is pronounced differently. 

— Diacritical symbols try to reuse many symbols which are common in other 
languages or even in English. Additionally, although we introduce diacritics 
which span over more than one symbol, there is always an equivalence to 
one-character annotations. This eases typographic issues when editing and 
printing annotated documents. 

Regarding the accents, the annotations must be able to distinguish between 

sounds which are different in at least one dialect. For instance, the derived pro- 
nunciations of 'cot' and 'caught' must be different, even though some speakers 
do not distinguish them. Other speakers may merge 'caught' and 'court', and 
similarly we may find homophones in some dialects which do not take place for 
others, such as in father/farther, formally/formerly, tune/toon, Lenin/Lennon, 
etc. Rhotic accents and their way to pronounce the vowels before an 'r' and the 
'r' itself suffer from many mergers. Our system can distinguish between 'merry', 
'marry', 'Mary', 'fairy' and 'merely', even though some speakers can pronounce 
'merry', 'Mary' and 'marry' in the same way. It also distinguishes between 'hurry' 
and 'furry'. This may give the impression that wc favour Received Pronunciation 
(RP, the standard pronunciation for English in England) over General American 
(GA, the standard pronunciation for American English). However, this is just a 
consequence of RP having more phonemes than GA. In some other cases, the 
annotation must give preference for one dialect, and we have to decide whether 
we annotate 'glance' with the 'a' in 'father' or with the 'a' in 'cat'. 

We do not distingiiish, however, between strong and weak forms for words 
such as 'the', 'a', 'to', 'and', 'of, etc. Even though, in some cases, the weak form 
is preferred (especially for 'the' and 'a'), we assume every word is independent 
and we annotate it as if it were pronounced independently. Consequently, we 
annotate the word 'you' as ^yQu^ ([jui]), even though it can be pronounced as [ja]. 
Nonetheless, our annotation system typically changes the default pronunciation 
of vowels depending on whether they are stressed or not. Consequently, our 
system is implicitly consistent with the strong and weak forms for many words. 
For some very common words such as the word 'the', we have incorporated a 
rule (by the way, a rule that any English speaker knows). 

Once explained what our proposal is, it is still convenient to state what 
■"Annotated English"' is not: 

— A spelling reform. English spelling is left untouched. 

— A phonetic transcription. Most of the spelling is reused to make up the final 
picture. Many words do not require annotations. 

* Disambiguation tools would be needed here, but note that a text have to be anno- 
tated once but then it can be read thousands of times. 
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— A set of rules for computer applications. Even though an important fac- 
tor for the success of this proposal is to provide computer programs which 
automatically annotate books and documents, this will be just a tool (an 
important one), but not the core of the project. '"Annotated English^ can 
be used to teach and learn English pronunciation and spelling, without the 
computer annotator. 

Wc have mentioned the advantages and applications of ''Annotated English'' in 
the abstract. Perhaps the most direct application is for non-native speakers, who 
struggle to learn the correct pronunciation of many words they have read many 
times but only occasionally or carelessly heard. In fact, native speakers have to 
suffer the terrible pronunciation of beginners, which are unable to coordinate 
their written English with their poor spoken English (or vice versa). A great 
proportion of members in international organisations, conferences, multinational 
companies, etc., come from non-English speaking countries. Even though they 
have a good proficiency of written English (in some cases better than many native 
speakers), their prommciation is not only incorrect, but full of particularities 
from their own languages, which makes their cross understanding more difficult 
(even for the native speakers). 

For native speakers, this may be useful as well. According to several studies 
(e.g. [6]), children take more time to read in English than those in other lan- 
guages such as Spanish, German or Italian. Explicitly connecting spelling with 
pronunciations through a set of rules and symbols could be very useful for chil- 
dren^. Furthermore, even after years of school, (adult) native speakers still do 
have difficulties with foreign words and technical words, and some words are pro- 
nounced differently depending on their region or their social level. The spelling 
system in English does not put many constraints on how a word should be pro- 
nounced, given its spelling^. This has produced that many new, commercial, 
foreign and technical words have different pronunciations. For instance, words 
such as 'Linux', 'Lego'®, 'genre', 'evolution', 'Neanderthal', 'amino' have several 
pronunciations. 

One of the key issues in this proposal has been to find a compromise between 
rules and required annotations. Of course, rules have to be learned (or inferred 
from annotated text). Thus, in order to avoid this effort, we could just explain the 
meaning of each annotation and use them profusely (because there would be no 
rules at all to rely on in order to save up some annotations). This would imply 
that most words would require several annotations. In fact, every vowel and 
many consonants would require the annotation at every position. Alternatively, 
we could make an exhaustive set of rules in order to avoid as many annotations 
as possible. The problem of this approach is that we may finally make up a much 

® Phonetic proposals to reading (which were very popular in the 1960s), such as the 
Initial Teaching Alphabet [3] were unsuccessful partly because the system should be 
abandoned and switched to 'real' English at some stage of the process. 

® In fact, it can be argued whether English spelling is something in between ideographic 
writing systems and phonetic writing system, with words in English being mere 
mnemonics. 
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too complex set of rules. This would make the reading of an annotated text a 
difficult task, apart from the time which would be required to learn the rules. 

The idea is to find a trade-off between the two extremes. We will develop a 
set of rules, as general as possible, many of which would be recognised by any 
English speaker, in such a way that they can be learned quickly. This short set 
of rules could reduce the number of annotations significantly, without causing 
ambiguities or confusions in the interpretation process. 

Table 1 shows the difference between an annotated text without rules (left) 
and an annotated text with rules (right). Wc have to say that it is not only a 
question of economy and clarity. The version with rules helps the reader assimi- 
late the pronunciation and spelling rules of English, so helping to bridge the gap 
between written and spoken English. 



Annotated text without rules 


Annotated text with rules 


It was the best of times, it was the worst 
of timgs, it was the agg of wisdom, it was 
the agg of foglishness, it was the epoch of 
belief, it was the epoch of incredulity, it 
was the season of Ligjjt, it was the season 
of Darkness, it was the spring of hopg, it 
was the winter of despair, ... 


It was the best of times, it was the worst 
of times, it was the age of wisdom, it was 
tlic age of foolishness, it was the epoch of 
belief, it was the epoclj of incredulity, it 
was the season of Light, it was the season 
of Darkness, it was the spring of hope, it 
was the winter of despair, ... 



Table 1. Difference between an annotated text witfiout rules (left) and our current 
proposal using rules (right) 



It is important to distinguish between the process of reading (interpreting) 
an annotated word and the process of annotating (coding) a word, as well as 
identifying their inputs and outputs, as shown in Figure f. Reading an anno- 
tated word can be done very easily and only requires a quick look at the rules 
and some practice with a few pages of text. Interpreting an annotated text takes 
the original English word and its annotations as inputs and produces its pro- 
nunciation. This process in unambiguous and has only one possible outcome. 
Conversely, coding a word is more cumbersome, because several annotations are 
possible. The good thing is that this task is not performed by the reader. Given 
an English word and a pronunciation, an annotator (a person or a computer 
program) has to choose between the possible annotations. In general, there are 
not many options, so given a word and its pronunciation, we can establish some 
criteria or set of scores in order to choose the best (or standard) annotation for 
the word. In fact, this process can be performed automatically by a computer 
program if we have access to the IPA transcriptions of all the English words 
appearing in a text. 
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Interpretation: 

English word + annotation — > pronunciation 
Example: 'locomotion' — > IPA: [|louka'mouJon] 



Coding: 

English word + pronunciation — > English word + annotations 
Example: 'locomotion' + IPA: [,louka'moujon] — > 'l^comcption', 'lo;:;om(j)tion', 'locom^otion', 

'locomotion' 



Standard Coding: 

English word + pronunciation + annotation costs — > English word + annotation 
Example: 'locomotion' + IPA: [,louka'moujon]) + (equal symbol cost) — > 'locomotion' 

Fig. 1. Inputs and outputs when interpreting and annotating texts. 

This document is organised as follows. In section 2 we present the set of 
symbols we will use and their basic meaning. We also include some IPA notation 
and briefly explain how the annotation symbols look like and how they can be 
produced in M^^X (you can ignore the M^jX commands if you are not going 
to use I^T[t;X as a text editor). In section 3 we present the rules to follow the 
annotation, i.e. the interpretation process. In section 4 we discuss the problem 
of annotating a document. Section 5 includes some annotated texts, which are 
very helpful to see how ^Annotated English"' works in practice. In section 6 we 
conclude the paper with a discussion about some preliminary statistics, whether 
the compromise between rules/annotations is correct, dialects, typography, the 
use of diacritical annotations, and, of course, future work. Appendices include 
alternative rules we finally did not implement, and some I^T^jX stuff. 

2 Symbols 

Here we introduce the symbols we will use, their meaning and how to use them 
in WT^. 



2.1 Phonetic Symbols 

For phonetic transcriptions, we follow the conventions from the International 

Phonetics Alphabet (IPA). We will use the symbols shown in table 2 (conso- 
nants and semiconsonants) and 3 (vowels and diphthongs). The only particular 
notation which differs from IPA is that we will use the notation [k] to represent 
either [j] or [o(j)]. This is useful to generally distinguishing between 'flower' and 
'flour', since the first is [flauaj] in GA and [flaua] in RP, while the second is 
[flauj] in GA and [flaua] in RP. With our notation, 'flower' is [flauaj] and 'flour' 
[flauK]. We use [j] to represent the special sound of the 'r' in English in contrast 
to other languages. In general, if we are only treating with English, we can use 
simply [r]. 

Using IPA, we can describe the pronunciation of a word as shown with the 
following example: 

explanation —> [.ekspla'neijan] 
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£}xcLiTiple / Ejxplan&tion 


lyiEA. commana (_\text;ipaj^ L- • - Jj/ 


[p] 


pill 


[p] 


[b] 


6it 


[b] 


[t] 


fen 


[t] 


[d] 


dive 


[d] 


[g] 


50 


[g] 


[k] 


cat 


[k] 


[m] 


mine 


[m] 


["] 


;/i(:c 


[11] 


[rj] 


sing 


[N] 


[r] / [A 


ray 


[r] / [\*r] 


[f] 


fine 


[f] 


[v] 


line 


M 


[9] 


thin 


[T] 


[3] 


them 


[D] 


[s] 


song 




[^] 


200 


M 


[J] 


shine 


[S] 


[3] 


leisure 


[Z] 


[tj] 


chin 


[tS] 


[d3] 


magic 


[dZ] 


[h] 


he 


[h] 


[x] 


loch 


W 


[j] 


yes 


P] 


[1] 


line 


[1] 


[w] 


ue 


[w] 


H 


when (some dialects) 


[\*w] 


W\ 


eoiupuiiT (some diali'ct.s) 





Table 2. IPA phonetic symbols (consonants and semiconsonants) . Adapted from 
http : //en . wikipedia . org/wiki/English_orthography 
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IPA 


ExciinpiG / Explciiicitiori 


TATt^X command Vtexticalf '\\\ 


rn 

LJ 


Start of stressed syllable. 


p'l 

[ J 


r 1 

L.J 


Start of secondary stressed syllable. 


[ j 




1 

be 


r;.l 


W 


bit 


FTl 




tool 


[u.J 




look 


FT Tl 


[eij 


rate 


[elj 


r^i 


anth em 


[®J 


[ou\ 


bone 


[oUJ 




met 


FT?! 




hand 


[\aej 


r, 1 


SMn 


roi 


[o:J 




[0:J 


r^i 
[dJ 


lock 


\a\ 

[oj 


[a:J 


father 


r A .1 
[A.J 


[aij 


fine 


[alj 


[oi] 


boy 


[OI] 


[au] 


OMt 


[aU] 


[ju:] 


muse 


[ju:] 


[a.] 


car 


[Ar] 


[3.] 


fem 


[3r] 


[0.] 


door 


[Or] 


[e(o).] / [eh] 


hair 


[E(@)r] / [EK] 


[i(e).] / H 


h ere 


[I(@)r] / [IK] 


[ai(8)j] / [aiK] 


fire 


[al(@)r] / [alK] 


[U{^)X] 1 [UK] 


moor 


[U(@)r] / [UK] 


[j«(a)-i] / 


pure 


pU(@)r] / [jUK] 


[ja] 


failure (reduced unstressed u ) 


m\ 


[e] 


esoteric (reduced unstressed o) 


[8] 


[«] 


beautiful (reduced unstressed u) 


[0] 


H 


lentil (reduced unstressed i) 


[1] 



Table 3. IPA phonetic symbols for vowels and diphthongs. Adapted from 
http :! lea., wikipedia . org/ wiki/English_orthograpliy 



10 



Some words have different pronunciations depending on the accent. The previous 
table is not exactly phonetic but pseudophonetic (they try to generalise several 
English accents), and it is custom for many English dictionaries, which only need 
to include different pronunciations when they are not inferrable from the accent 
(e.g, the two different pronunciations for the word 'either'). 

We incorporate a slight modification in the way stress is coded in IPA. Instead 
of placing the stress at the beginning of each syllable, we place the stress at the 
first vowel in the syllable. This avoids the elicitation of where syllables start or 
end, an issue which is not strictly necessary for pronunciation, especially when 
the word is read at a normal pace. The following example shows this modification: 

explanation [.eksplan'eijan] (instead of [.ekspla'neijan]) 
Note that this does not mean that we have to split the word into 'explan-ation'. 
It is just that we place the stress on the vowels, since each syllable has only one 
vowel group. 

2.2 Annotated cind Non-einnotated Text 

It is generally obvious to tell annotated from non-annotated text and vice- versa, 
since we see some symbols over and beneath the letters in one case and none in 
the other. Nevertheless, in some cases, a word or a part of a document can be left 
non-annotated. In order to avoid confusion, we need to make; that clear. That 
can be the case with words that have more than one possible pronunciation (and 
the annotator does not want to choose among them). Another case is when the 
annotator (which might be an automated system) docs not know the annotation 
for a word. Finally, another case is for foreign words, especially for places and 
names. 

Non-annotated text indicator: We use two symbols for opening 
(\nonl{}: J and closing (\nonr{}: J. 

Examples: The first day in Spain, we stayed in a small village called 
^ZaragiiCj, near ^Setfas^. 

Similarly, in a non-annotated text, we could annotate some difficult or technical 
words. 

Annotated text indicator: Wc; iisc^ two symbols for op(;ning 
(\annl{}): j uud closing (\annr{}: j. 

Examples: The word 'catastrophe' is pronounced ^catastrophe"'. 



2.3 Silent Letters, Stress and Separators 

One of the most useful annotations is to make a letter silent. 

Silent: Any letter can be eliminated (it becomes silent) with the symbol 

(\si{a}): a- 

Examples: ghost, whergas. 
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A stress symbol can be used to locate stresses for words with more than one 
syllable. 



Stresses: Main stress (\st{a}): a. Secondary stress (\stst{a}): a, 
Examples: ago, fundamental, anthology 

We have five different separators, which are useful to break digraphs and to 
separate compound words. They may also have an effect on stress. 

Unstressed separator: The word is broken but stress is not modified: 
(a\se{}e): ae, and we can use more than one in a word. The rules for 
the stress do still see the word as a whole. The main use of this is to 
break digraphs. 
Examples: parenthood. 

One-stress separator: Only the left part is stressed (a\sel{}e): ae. 
Only the right part is stressed (a\ser{}e): ae. 
Examples: gomg, away. 

Two-stress separator: Both parts are stressed, but the main stress 
goes to the left part (a\selr{}e): ae. Both parts are stressed, but the 
main stress goes to the right part (a\serl{}e): ae. 
Examples: northbound. 

More Examples: fathe9,d, homeless/homgless/homeless/homeless, 
sidecar /sidgcar, awhile, althoygh, outgoing. 

Occasionally, me may need to introduce a schwa (an unstressed central vowel) 

Schwa: (a\sch{}b): a'b. This can be done; without introducing the; tiny 
HpMC(> l)('t\v(>cn lell(>rs as (\ssch{ab}): al). 

5 5 

Examples: autis'm / autism (only some accents), met're / metre (this is 
not needed since there is a rule which fixes this) 



2.4 Vowels 

Vowel pronunciation requires many annotations given the reduced number of 
vowel letters in the Roman alphabet {a,e,i,o,u) plus the use of (w and y), com- 
pared to the huge number of vowel phonemes in English. 

English has 17 stressed vowel phonemes (including [wai] and excluding the 
rhotic sounds, i.e., when an 'r' follows) that can be originated from one or two 
written vowels. Fortunately, we do not need 17 different symbols, since one single 
letter can only have at most 10 different sounds (excluding the effect of a fol- 
lowing 'r'). For instance, 'o' sounds differently in these words: pot, note, bought, 
love, Goethe, women, do. woman, now. foie. 

Plain: Single letter (\pln{a}): a. Double letter (\ppln{ae}): a!e is not 
used. Before i, it goes like this: (\pln{\i}): i. 
Examples: anthology, seven, Italy, ugly. 
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Natural: Single letter (\nat{a}): a. Double letter (\nnat{ae}): ac. Be- 
fore i, it goes like this: (\nat{\i}): i. 

Examples: range, equal, ghost, bind, reggae, neither (GA), linite. 

Broad: Single letter (\brd{a}): a. Double letter (\bbrd{ae}): ac. But, 
(\bbrd{a}): a is incorrect. Before i, it goes like this: (\brd{\i}): T. . 
Examples: father, cafe, machine, bought, field, super. 

Clear: Single letter (\clr{a}): a. Double letter (\cclr{ae}): ac. But, 
(\cclr{a}): a is incorrect. Before i, it goes like this: (\clr{\i}): i. 
Examples: some, love, blood, sergent, Magrcb, lingerie. 

Central: Single letter (\ctr{a}): a. Double letter (\cctr{ae}): ae is not 
used. Before i, it goes like this: (\ctr{\i}): i. 
Examples: any, biiry, word, terrible. 

'loted': Single letter (\iot{a}): a. 

Examples: English, ended, women, chicken, busy, message 

Round: Single letter (\rnd{a}): a. Double letter (\rrnd{ae}): ae is not 
used. Before i, it goes like this: (\rnd{\i}): i. 
Examples: all, do, se^, m^uve 

Opaque: Single letter (\opq{a}): a. Double letter (\oopq{ae}): ae. But, 
(\oopq{a}): a is incorrect. Before i, it goes like this: (\opq{\i}): i. 

Examples: watch, woman, put, book, woilld, envelope 

'i-diphthong': Single letter (\idp{a}): a. Double letter (\iidp{ae}): 
ae. But, (\iidp{a}): a is incorrect. Note that (\idp{ae}):ae is incorrect. 

Examples: §ye, Einstiin, paella, f5ie. 

'u-diphthong': Single letter (\udp{a}): a. Double letter (\uudp{ae}): 
ae. But, (\uudp{a}): a is incorrect. Note that (\udp{ae}):ae is incorrect. 
Examples: now, Laos, Freudian. 



2.5 Semiconsonants 

Semiconsonant annotations are special in the way that they can convert any 
letter (or group of letters) into the (semi-) consonants 'y' and 'w'. Additionally, 
if they are placed between letters, they introduce the sound. 

Semiconsonant w : Single letter substitution (\w{a}): a. Introduction 

(W}): ab. 

Examples: suede (substitution), ouija (substitution), one (introduction), 
once (iutroductioii) 

Semiconsonant y : Single letter substitution (\y{a}): a. Introduction 

(\y{}): ab. 

Examples: hallelujah (substitution), tortiUa (substitution), onion (sub- 

" Y Y ^ 

stitution), million (substitution), failure (introduction). 
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2.6 Consonants 

Consonants are more regular and require a fewer number of symbols than vowels. 
Common change: (\co): g. 

Examples: gill, target, Celtic, fajlta, loch (locjj in most accents), LaTeX 

Voiced and voiceless : (\vo, \iio): s, si 

Examples: house, thing, scissors, topped 

Soft voiced and voiceless: (\svo, \sno): x, si 

Examples: notion, vision, luxury, migsion, mission (but it would be better 
as mission) 

Hard voiced and voiceless : (\hvo, \hno): g, ti 
Examples: celo, nature, question, soldier 

In some cases, especially when grouping more than two letters with the same 
annotation, the graphical representation of the annotation may not span all over 
the letters. In this case, we can use a 'group', which is not an annotation, but 
a graphical/typographical sign to better understand that the annotation affects 

the group of letters. 

Grouping: Useful to group two or more letters, before applying the 
annotation. It is generally applied for consonant annotations, but it can 
also be applied to the 'silent' symbol '"'. (\group); abc. 
Examples: mission, exaggerate, tortilla, yacht, lough. 



These are the symbols we will use to convert any English text into an annotated 
text such that its pronunciation can be derived unambiguously. Wc have not 
included here an exhaustive explanation of the sounds of every annotation. Doing 
so would allow the annotation of any text. But, without any rule, this would yield 
verbose annotations as shown on the left of Table 1. In the following section we 
present a set of rules to dramatically reduce the number of annotations. Thus, 
the exact IPA phonetic of each combination of letter (or digraph) + annotation 
will be precisely defined next. 

3 Rules of Pronunciation for Annotated English 

The rules wc present in this section will allow a reader to unambiguously deter- 
mine the pronunciation of an annotated word. The basic idea is that the words 
(or parts of words) who have no annotations will be pronounced as the rule says. 
The words (or parts of words) with annotations will highlight an exception to 
the rules, and the annotation will clarify the irregular sound it corresponds to. 
For instance, the word 'that' requires no annotation. First, the digraph 'th' here 
is voiced, which follows a rule saying that it miist be so when appearing before 
a vowel. Second, 'a' is in a plain position (followed by a consonant at the end of 
the word). Third, 't' is regularly pronounced. Quite differently, the word 'house' 
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requires an annotation, because 'h', 'ou' and the final 'e' sound as the rules indi- 
cate, but the consonant 's' must be annotated as voiceless since this case breaks 
the rule that an 's' is voiced between vowels. 

The rules must be applied in a given order. We arrange the process in 12 
steps, as follows: 

1. Discard (crossed-out) letters. 

2. Identify word segments by hyphens, apostrophes and separation annotations 

3. Parse critical digraphs ('ng', 'ng', 'gh', gh, 'wh', 'wr', 'qu', 'gu'.). 

4. Distinguish vowel and consonant groups. Handling semiconsonants 

5. Distinguish vowel units in groups using digraphs (aa, ae, ai, ay, au, aw, ea, 
ee, ei, ey, eu, ew, oa, oi, oy, oo, ou, ow). 

6. Place the stresses for each vowel unit. 

7. Categorise vowel ocurrences (stressed/ unstressed, natural/plain and rhotic/ non- 
rhotic) . 

8. Evaluate vowel units. 

9. Distinguish consonant units in groups using digraphs (kn, pn, gn, cn, ph, ch, 

sh, ps, rh, pt, th, gg, ss). 

10. Evaluate consonant units. 

11. Recompose the words. 

12. Postproccss the IPA result (elimination of repeated consonants and inclusion 

of schwa into final -#r). 

The example in Figure 2 shows how an annotated sentence is converted into IPA 
following the previous process. 

Next, each step of the process is explained in one subsection each. 

3.1 Discard (crossed-out) Letters 

The first step of the process is easy. As we saw, the symbol 'J indicates that the 
letter is crossed out (and hence becomes silent), which means it is deleted and 
discarded for any subsequent rule. For instance, 'pot' and 'pot§' are considered 
equal. 

Examples: cupbgard ([kAbard]), dum^ ([dAm]), arg ([ar]). 
Once silent letters are removed we proceed to identify groups and segments. 

3.2 Identify Word Segments. Hyphens, Apostrophes and Separators 

We use the notions of groups and segments, instead of a notion of syllable, which 

is not used in our annotation system. Segments arc independent parts of a word. 
Words only have one segment, unless they are broken by a written hyphen (e.g. 
two-way), an apostrophe (e.g. it's) or by a separation annotation ('', '', '', 

When an apostrophe has a right-hand side starting with an 'r' (you're, 
they're, etc.), only one segment is created, in order to consider the possible 
effect of the 'r' over the vowels on the left-hand-side. 
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(0) Nobody's questioning that Australopithecus is a genus of hominids that arg nowadays extinct. 

(1) Nobody's questioning that Australoj>ithecus is a genus of hominids that ar nowadays extjnct. 

(2) (Nobody) (s) (questioning) (that) ([jAustralo), (['Jpithccus) (is) (a) (genus) (of) (hominids) 

(that) (ar) (nowadays) (extinct). 

(3) (Nobody) (s) (kwestionin) (that) ([JAustralo), (['Jpithecus) (is) (a) (genus) (of) (hominids) 

(that) (ar) (nowadays) (extinct). 

i 

(4) (N,o,b,6,d,y) (s) (kw,c,sti,o,n,i,y) ([']^h,a,t) ([,]Au,str,a,l,o), ([']p,i,th,c,c,u,s) (i,s) (a) (g,e,n,u,s) 

(o,f) (h,6,m,i,n,i,d,s) (th,a,t) (a,r) (n,6wa,d,ay,s) (c,xt,i,nct). 

(5) (N,o,b,6,d,y) (s) (kw,e,sti,o,n,i,n) ([']th,a,t) ([,]Au,str,a,l,o), ([']p4,t_h,c,c,u,s) (i,s) (a) (g,e,n,u,s) 

(o,f) (h,6,m,i,n,i,d,s) (th,a,t) (a,r) (n,6w,a,d,ay,s) (e,xt,i,nct). 

i 

(6) (N,[']o,b,6,d5y) (s) (kw,[']e,sti,o,n,i,n) (th,[']a,t) ([,]Au,str,a,l,o), p,([']i,th,e,c,u,s) ([']i,s) ([']a) 

(g,[']e,n,u,s) ([']o,f) (h,[']6,m,i,n,i5d5s) (th,[']a,t) ([']a,r) (n,[']6w,a,d,ay,s) (e,xt,[']i,nct). 

i 

(7) (N,[']o,b,6,d,y) (s) (kw,[']e,st„i,o,n,i,n) (th,[']a,t) ([,]Au,str,a,l,o), p,([']i,th,e,c,u,s) ([']i,s) ([']a) 

(g,[']e,n,u,s) ([']6,f) (h,[']6,m,i,n,i,d,s) (th,[']a,t) ([']a,r) (n,[']6w,a,d,ay,s) (e,xt,[']i5nct). 

i 

(8) (N,['ou],b,[D],d,[i]) (s) (kw,['s],stj,[3],n,[i],n) (th, ['a;] ,t) ([,3:],str,[s],l,[ou]), (p,['i],th,[9],c,[3],s) (['i],s) 
([ei]) (g,rh],„,[3],s) ([d],!-) (h,[D],m°([i].„.([i],ds) (th.['ao],t) ([arl.r) („,['au],[3],d,[ci],s) ([i],xt,[i],nct). 

i 

(9) (N,['ou],b,[D],d,[i]) (s) (kw,['E],st_i,[a],n,[i],n) ([3],['a3],t) ([,o:],str,[8],l,[ou]), (p,['i],th,[9],c,[a],s) (['i],s) 
(['ei]) (g,['i:],n,([a],s) (['D],f) (h,['D],m,[i],n,[i],ds) ([8],['ae],t) (['ar],r) (n,['au],[a],d,['ei];s) ([i],xt,['i],nct). 

i 

(10) ([n],['ou],[b],[D],[d],[i]) ([z]) ([kw],['e],[stj],[a],[n],[i],[r,]) ([S],['ae],[t]) ([,D:],[str],[8],[l],[ou]), 
([p]>H>[e],N,[k],(N,W) ([■«]) ([d3],['i:],[n],([8],[s]) (['d],[v]) (h,['D],[m],[i],[n],[i],[dz]) ([8],['ail,[t]) 

([■ar],[r]) ([n],['au],[a],[d],['ei],[zl) 
([i],[kst],['i],[nkt]). 
i 

(11) ([n'oubodi-z kw'estjomrj S'aet .Distraloup'iSakas 'iz ei dg'iinos 'nv h'ommidz S'aet 'arr n'auadeiz 

ikst'mkt]). 

(12) ([n'oubodi-z kw'estjami) d'^t ,o:straloup'i6akas 'iz ei dg'iinas 'dv h ommidz 5'aet 'ar n'auadeiz 

ikst'mkt]). 

Fig, 2, An example of the process of interpretation following the rules. 
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Examples: two-way (two-way), it's (it-s), Tom's (Tom-s), I'll (I-U), ygu're 
(ygure), Ireland (Ire-land), including (including), policymaker (policy-maker), 
going (go-ing), undergo (under-go), fathead (fat-he§,d). 

Segments are very important to break possible digraphs or to break up com- 
pound words, as we have seen in some examples above. 

3.3 Parse Critical Digraphs 

There are some consonant digraphs that deserve special rules at this early mo- 
ment of the process, because they become silent or partly silent, or they involve 

a semiconsonant (and we need to distinguish between consonants and vowels the 
sooner the better). The rules arc as follows and arc applied in this order. 

1. 'ng', 'ng'. The digraph 'ng' becomes 'n' when it is at the end of a segment, or 
before any consonant except '1' and 'r'. In other words, we have a softer [q] 
before any combination which makes impossible to articulate the 'g' sound. 
Examples: 'sing', 'yQungster', 'Sing}}'. In all the other positions, the digraph 
'ng' becomes 'nj' ([nd3]) if it is followed by 'e', 'i' or 'y' (e.g., ginger, angel, 
spongy) and 'ng' ([qg]) in all the other cases ('bingo', 'finger', 'English', 
'single', 'angry'). Note how the words 'singer', 'bringing' and 'clingy' are 
separated into two segments in order to get the proper sound. When 'g' is 
annotated as 'g', the digraph 'ng' is always converted into 'ng' ([rjg]) (e.g. 
'anger'). When 'g' after 'n' is annotated as 'g' (i.e. 'ng'), it is always ([nds]) 
(e.g. 'dungeon'), and when 'g' is annotated as 'g', it is always pronounced 
[ns] as in 'lingerie'. In these two cases ('ng' and 'ng') we do not need to 
perform any action at this moment of the process. 

2. 'gh', gh. The digraph 'gh' becomes silent everywhere. For instance, it be- 
comes silent for words such as 'high', 'higher', 'though', 'thought', 'highness', 
'neighbour', while it does not for 'tough', 'bi^heg,ded'. At this moment of the 

process, we also apply the annotation gh, which makes 'gh' convert into 'a', 
as in 'Edinburgh'. In other cases, where the digraph has been previously de- 
stroyed, there is no need to apply this rule, such as in 'ghost' or 'Maghreb'. 
The annotations gh, gh are processed later in the process. 

3. 'wh'. The digraph 'wh' becomes 'w' (that will be finally transcribed into 
([w] or [av] depending on the dialect) at the beginning of a segment or when 
it follows a consonant. Examples: 'handwheel', 'when', 'awhile, worthwhile, 
nowhere, everywhere, somewhere, but not with 'cowhide' (because it follows 
a vowel and also because it is annotated). When annotated as 'wh', this is 
left for later steps of the process (and will be finally transcribed into ([av])). 
Of course we do not have to care when the digraph has been previously 
destroyed, as in '^ho' or '-^hole'. 

4. 'wr'. The digraph 'wr' becomes 'r' except when it comes after a vowel 
a,e,i,o,u. Examples: 'write', 'wrong', 'shipwreck', 'rewrite' (because of the 
separation), 'awry' (because of the separation). It does not disappear in 
'Jewry', 'arrowroot', dowry. 
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5. 'qu'. The digraph 'qu' is always understood as 'kw' before a,e,i,o,u,y, and 
does not need any annotation for the 'u' to sound as a semiconsonant, as in 
'quahty', 'quo', 'quest', 'equal', 'liquor', 'aqua', 'barbeque' (better 'barbecue'. 
If the 'u' does not sound, it has to be crossed out ('opaqye', 'critique)', 
'queye', 'qyesadiUa'). 

' Y 

6. 'gu'. The digraphs 'gu' is always understood as 'gw' before a,o,u and as 'g' 
before e,i,y. For instance, the 'u' is converted into the semiconsonant sound 
'w' in 'guat', 'Guatemala", 'guano', 'ParaguSy', 'language'. It is silent in 
'guess', 'guest', 'baguette' 'guineg,', 'disguise', 'guide', 'guilty', 'guitar', 'guy' 
(which becomes 'gy'), 'Portuguese', colleague, analogue (note that the 'u' 
becomes silent at this stage of the process, while the 'e' will become silent 
by a rule we will explain below), vogue, tongue, fatigue. The 'u' is kept as 
a vowel when it is followed by a consonant or it is annotated, as in 'gun', 
'langyur', 'gubernatorial', 'ambiguous', 'ambigiiity', 'penguin', 'distinguish', 
'argiae'. The 'u' can also become silent before an 'a', but in this case, following 
the rules, it requires a cross-out annotation, such as in 'guard', 'gyarantee'. 

7. 'ah', 'eh', 'ih', 'oh', 'uh', 'yh', 'wh'. These digraph make the 'h' silent before 
a consonant. In other words, 'h' is silent between vowel and consonant. Ex- 
amples: ah, eh, oh, ooh. uh, huh, Phar§,oh, Pharaohs, Shah, Yahwch (note 
that 'Yajjweh' would create the digraph 'aw'), yeah, Sarah, Cohn, Kahn, 
John, Kiihn, Nahcolite, Fahd, Dahl, Kohl, buhl, Lehman, ohm, Bahrain, 
Bohr, Riihr. 

It is interesting to remark (again) that the previous rules are applied in the order 
they appear. For instance, the word 'bilingual' is converted by the 'ng' rule into 
(bi-lingual) . The 'gu' rule finally converts it into (bi-lingwal) There are more 
digraph rules for vowels and consonants, but they will be seen when vowels and 
consonants are processed. Wc have placed some of them here, because it is very 
important to consider 'w' a semiconsonant and not a vowel in some situations, 
and the 'ng' rule must come before the 'gu'. The 'gh' rule is given now to consider 
the introduced vowel gh, and to make the natural sound for 'i' in 'high'. 

3.4 Distinguish Vowel and Consonant Groups. Handling 
Semiconsonants 

After the previous process, we are closer to distinguish between vowels and con- 
sonants, but there are still some details about 'y', 'w' and other vowels becoming 
consonants (and viceversa) that we need to consider. 
It is traditionally understood that: 

— Vowels in English are a, e, i, o, u. 

— Consonants in English are h, c, d, /, g, h, j, fc, Z, m, n, j>, q, r, s, t, x, z. 

— The letters w and y are considered vowels in some cases and (semi)consonants 
in other cases. 
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Apart from clarifying w and y, we find cases where u becomes a (semi) consonant 
in some special cases (apart from 'qu', 'gua', 'guo', 'guu' and eventually when 
annotated after other consonants), and some other annotations converting vowels 
into consonants and vice-versa. 

We start analysing when w and y are vowels and when they are consonants. 
The rule is as follows. The letters w and y arc vowels if they arc before a conso- 
nant, at the end of the group, they are part of a vowel digraph {ay, ey, oy, aw, 
ew, ow), or they are affected by a vowel annotation. Examples: 'cwm', 'lynx', 
'my', 'cry', 'ray', 'delayed', 'boyfriend', 'buying', 'flowing', 'eye' arc vowels, while 
'yell', 'lawyer', 'when' (remember that this arrives as 'wen' and not 'w-hen' at 
this moment of the process'), 'will', 'klwT', 'beware' are consonants. Note that 
if w or y arc crossed out, there is no need to decide, such as in 'whole'. Also, y 
is considered a vowel if found after one or more consonants. For instance, y is 
a vowel in 'bye', 'cyan', 'Libyan', 'Byelorussian', etc. This is not applied for w 
('twine'). We can combine y and w as in 'Wyoming', where we have that 'w' is 
a consonant and 'y' is a vowel. 

There are several annotations that may convert a vowel into a consonant. 
If the letter 'u' is annotated as 'u', it is transformed into a (semi) consonant 
'w'. Examples: the 'u' in 'penguin', suave, suede, puerto becomes the semicon- 
sonant 'w' (remember we do not have to annotate Paraguay, but the 'u' also 
becomes 'w'), while the 'u' in 'gun', 'guru', 'dual', 'fuel', 'duo' is clearly a vowel. 
Note that we do not have to care about cases where the 'u' has been previ- 
ously deleted, such as 'guarantee'. Remember that the digraph 'qu' is always 
understood as 'qw' before a,e,i,o,u,y, and does not need any annotation, as in 
'quality', 'quo', 'quest', 'queen', 'quasi', 'cquar, 'liquor', 'aqua', 'colloquy'. But 
note that in 'Quran', 'Qtim' (a city), etc., it is considered a vowel. Some words 
taken through Spanish or French may require annotations since the 'u' is silent, 
such as 'Qyechua', 'liqygur' (or perhaps better 'liqucyr'), 'burlcsqyc', 'opaqye', 
'critlqye)', 'qyesadiUa' or a formal pronunciation of 'Qy.ebec'. Some other strange 

Y 

spellings may also require this, such as 'qy§y', 'queye'. As we have also seen, 

'gue', 'gui' and 'guy' make the 'u' silent, while 'gua', 'guo' and 'guu' convert it 
into 'w', unless annotated, such as in 'guarantee' and 'penguin'. 

The groups 'o' and 'ou' in some words may become 'w' as well. This is also 
denoted by 'j. If the sound is completely replaced, as in 'QuTja', 'voyeyr' (RP), 
'choir' we use 'J just below the letter (or the group of letters, 'OuTja', although 
this may be confusing if we do not use the grouping: 'OuTja'). If the vowel sound 
is preserved and the 'w' appears before the vowel, we denote that by but it is 
placed between the previous letter and the vowel, as in 'j3ne', 'anyone' or Qantas. 
Note that we introduce the 'w' sound in our decomposition, so 'jane' becomes 
(w,6,n,e). 

The letters 'i' and 'y' can be transformed into or forced into a (semi) consonant 
'y', by using the annotation For instance, 'onion', 'bunion' (note that 'ni' is 
now considered a complex consonant), Hierro (the Canary island), Kenya (note 

Y 

that even it is a 'y' it would be considered a vowel if not annotated), California, 
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canion, miliey, million. But note that 'Romanian', 'Libya', Iberian, simian, etc., 
maintain the vowel (this also depends on the accent). The same symbol '' can also 
be used to annotate the introduction of the phoneme [j] between the letters where 
it must be placed. Examples: 'failure' (which is better annotated as failure), 
'lasagiia' (the introduction of a new consonant makes the group complex). 

Finally, there is a bizarre u sound which becomes f, as in lieutenant. Some 
other vowels may become consonants in conjunction with other consonants if so 
annotated. For instance, di, de, ti, ... as in soldier or notion. 

After the previous cases, it may seem difficult to tell between consonants 
and vowels. But it is not so if we summarise and put everything together. The 
following set of items clarify (and summarise) this: 

— Vowels in Annotated English are a, e, i, o, u and any group of characters 
which have an annotation above the letter (e.g.. a. 6w, gh). The letters w 
and y are vowels if they are before a consonant, at the end of the group 
or they are part of a vowel digraph (ay, ey, oy, aw, ew, ow). Also, y is 
considered a vowel if found after a consonant. 

— Consonants in Annotated English are b, c, d, f, g, h, j, k, I, m, n, p, q, r, 
s, t, X, z and any (possibly empty) group of characters which have an anno- 
tation below the letter (e.g., f, ^ ou, u). The letters w and y are considered 
consonants when they are not considered vowels. The letter u is considered a 
consonant in the unannotated digraph 'qu', and in the unannotated groups 
'gua', 'guo', 'guu'. 

We will Tise the notation @ for a vowel and # for a consonant (this is notation 
for this explanation, not part of the annotations). 

Only after the previous rules, we are able to decompose each word segment 
into vowel and consonant groups. 

For each segment, we have to distinguish its groups. We use the term vowel 
group for a series of consecutive vowels in a segment. Similarly, a consonant group 
is a series of consecutive consonants. Note; that a vowel group is not necessarily 
a unit (a single vowel sound or a diphthong). For instance, 'ie' in 'science' is a 
vowel group with two units. Similarly, 'doing' has a vowel group '6i' which is 
composed of two units. Some other two- vowel groups can have one unit, such as 
'ow' in 'grow'. Note that groups cannot be made across consecutive segments. 
For instance, the groups in going preclude o and i from being together. 

Examples: consecutive (c,o,n,s,e,c,u,t,i,v,e), science (sc,ie,nc,e), de^t (d,e,t), 
fasj^en (f,a,s,e,n), gomg (g,o- i,ng), diphthong (d,i,phth,o,n), 'cry' (cr,y), 'boyfrjend' 
(b,oy,fr,end), 'byying' (b,yi,ng), 'flowing' (fl,owi,n), 'gye' (gye), when (w,e,n), 
'kTwT' (k,T,w,T), 'beware' (b,e,w,a,r,e), 'cyan' (c,ya,n), 'twine' (tw,i,n,e), 'Guatemala' 
(Gw,a,t,e,m,a,l,a), 'giirii' (g,u,r,ii) , 'ghost' (g-6-st), 'shipwreck' (sh,i,p,r,e,ck), 
'penguin' (p,e,ngu,i,n), 'gyest' (g,e,st), 'quality' (kw,a,l,i,t,y), 'liquor' (l,i,kw,o,r), 
'Qaija' / 'OuTja' (Q21,T,ja) / (u,T,ja), 'away' (a,w,ay). 
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3.5 Distinguish Vowel Units 



The first thing to be done when a vowel group is to be pronounced is to determine 
the vowel digraphs (in case) in order to determine the vowel units. Vowel units 
arc also useful to place the stress, since stress is always on vowel units (a syllabe 
has one and only one vowel unit). 

The vowel pairs which are considered digraphs and those which are not are 
shown below: 

— Digraphs: aa, ae, az, ay, au, aw, ea, ee, ei, ey, eu, ew, oa, oi, oy, oo, ou, ow. 

— Non-digraphs: ao, eo, oe, i@ {ia, ie, ii, iy, io, iu, iw), and u@ {ua, ue, ui, 
uy, uo, uu, uw). 

There are no vowel trigraphs. A vowel group with only one vowel is a vowel unit. 
When two or more vowels are found together in a group, we form digraphs from 
left to right (so, 'eei' becomes (ee,i) and not (e, ei)). The parts which are not 
grouped by a digraph are left as single units. Note that any deleted vowel is not 
considered for digraphs, since they are deleted by a previous rule. 

For instance, 'bait' applies the digraph and 'ai' is maintained as a unit (b,ai,t). 
The word 'science' becomes (sc.i.cne.c;) because 'ie' is not a digraph. If a vowel 
in a group has an annotation, this breaks any possible digraph. For instance, 
'yeah' breaks the possible 'digraph' ea and then becomes (y,e,a,h). If the second 
vowel has a stress annotation it also breaks the unit (oasis). When an anno- 
tation spans over two vowels, it means that these two vowels make a unit, as 
in 'field' (f,ie,ld). When a stress annotation is placed over the second letter of 
a vowel group, than the vowel group is split into two units (e.g., suprainguinal 
(s,u,pr,a,i,ngu,i,n,a,l). A silent annotation between two vowels put them together 
(e.g., 'vehicle' (v,e,i,cl,e) must annotate the first 'e' to avoid the interpretation 
(v,ei,cl,e). It is the same for the silent digraph 'gh', but luckily in words such as 
'higher' we have the cluster 'ie' which is not a digraph (h,i,e,r). And, of course, 
separators break units (e.g. away) 

Examples: 'going' (g,o - i,ng), 'buying' (b,uy,i,ng), 'flowing' (fi,ow,i,ng), 'see- 
ing' (s,ee,i,ng) 'lion' (l,i,o,n), 'new' (n,cw), 'Eye' (5y,c), 'algae' (a,l,g,ac), 'miaow' 
(m,T,ow), 'Halloween' (H,a,ll,ow,ee,n), 'meow' (m,e,ow), 'shoe' (sh, 6, e), 'cochleae' 
(c,o,c,l,e,ae), 'aeplian' (ae,o,l,i,a,n), HawSi (H,a,w,5i,i) (note how the cluster 'aw' 
is broken), 'Edinburgh' (E,d,i,nb,u,r,gh), 'gluey' (gl,u, ey), 'gluier' (gl, u, i, e, r). 

Following the previous rules, we realise that we can only have vowel units 
with one or two vowels, never more. 

3.6 Place the Stresses for Each Vowel Unit 

Stress applies to vowel groups and ultimately to vowel units. Here we account 

to placing the stress over the vowel groups. There are stressed segments (which 
can be either primary or secondary) and unstressed segments. We follow a set 
of rules, in this order: 
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1. If a word is not separated into segments, it is a primary-stressed segment 
(e.g., 'house', 'difference', 'ago'). 

2. If a word is separated by a hyphen, only the first segment is considered 
primary-stressed and the second is unstressed (e.g., 'yo-yo')). 

3. If a segment is separated by an apostrophe, only the first segment is con- 
sidered primary-stressed and the second is unstressed (e.g., 'it's'). This rule 
can be broken if the apostrophe is annotated: o'clock. 

4. The segment 'the' is special. It is considered unstressed if it is followed by 
another segment (word) starting with a consonant and stressed elsewhere. 

From these principles, more segments can be created by further separations due 
to annotations, which also have consequences on the stress. 

1. If we see the separator the two new segments do not modify their stresses. 
Examples: parenthood (only the first segment is stressed). 

2. If we sec the separator '', the first segment becomes primary-stressed and 
the second unstressed. Examples: gomg, homeless. 

3. If we see the separator the first segment becomes unstressed and the 
second primary-stressed. Examples: aboimd. 

4. If we see the separator 'J, the first segment becomes primary-stressed and 
the second secondary-unstressed. Examples: fatheg,d, northbound, sidecar. 

5. If we sec the separator the first segment becomes secondary-stressed and 
the second primary-unstressed. Examples: although. 

The two first separators may seem to have the same effect, but they are different. 
For instance, the following annotations are not equivalent: outgoing and outjgoing 
(the first only gives primary stress to the first vowel group while the second gives 
primary stress to the two first vowel groups). In any case, the correct annotation 
for this word would be: outgoing (or better, outgoing). 

It is interesting to see examples of unstressed segments: Ireland (land), going 
(ing), engage (en), Birmingham (ham). It's ('s). They've ('ve). The wind (The). 

Now we concentrate on placing the stress over the vowel units in each seg- 
ment. 

Unless annotated otherwise, all primary stressed segments are assumed to 
have the primary stress on the first vowel unit and no other stressed unit. Unless 
annotated otherwise, all secondary stressed segments arc assumed to have the 
secondary stress on the first vowel unit and no other stressed unit. 

Further stressed units in each segment are indicated with the symbol 'a' 
for primary stress and 'a' for secondary stress under the first letter of the vowel 
group. The use of the symbol 'st' removes any other primary stress in the segment 
(only one primary stress per segment, unless all of them are annotated). For 
instance, ago we have that 'a' is unstressed while 'o' is stressed. The annotation 
for the secondary stress 'a' does not remove any primary stress we could have 
previously or any other annotated secondary stress. For instance, 'complicate' 
introduces a secondary stress for the 'a' but does not remove the primary stress 
for the 'o'. That means that we do not need to annotate that like 'complicate'. 
We can have several secondary stresses, such as in 'electroencephalogram' 



22 



Examples of primary stressed vowel units are: plain (ai), assume (u), differ- 
ence (i), science (i), related (a), ago (o), policeman (i), consecutive (first 'e'). 

Examples of secondary stressed units: claustrophobic (primary: o, secondary: 
au), kangaroo (primary: oo, secondary: a), software (primary: o, secondary: a), 
telephone (primary: e, secondary: o), 

When there are at least two syllables (vowel units) on the left of a primary 

stress, and there are no annotated secondary stresss, we assume a secondary 
stress two syllabic (vowel units) on the left. 

Examples of inferred secondary stress unit: 'parabolic' (primary: o, secondary: 
first a), undergo (primary: o, secondary: u), incredulity (primary: u, secondary: 
first i), 'inconsistent' (primary: second i, secondary: first i), 'coexist' (primary: i, 
secondary: o), reconstruct (primary: u, secondary: e), European (primary: sec- 
ond e, secondary: eu), serviette (primary: secon e, secondary: first e). This is 
not applied if there arc less than two syllables on the left (e.g. ago where a is 
not stressed, undo). If we place the secondary stress elsewhere, the rule is bro- 
ken: 'electrolysis' (primary: o, secondary: second e), attendee (primary: second 
e, secondary: first c). Note that if wc want to indicate two secondary stresses, 
the rule is broken and wc need to mark everything, as in 'unambiguous'. 

Finally, any vowel unit before a consonant group which contains the annota- 
tions or " is assumed to have a primary stress, unless the primary stress 
is annotated elsewhere. If there are more than one consonant group with these 
annotation, the rule is only applied for the rightmost. 

Examples of inferred primary stress unit: 'compression' (primary: e), 'transmission' 
(primary: first i), 'illusion'. Note that any primary stress at any other location 
breaks the rule: 'machine', ^yittgenst§in. 

Note that the inferred primary stress and the inferred secondary stress can 
be combined. 

Examples of inferred primary stress unit and inferred secondary stress unit: 
'television' (primary: first i, secondary: first e), 'information' (primary: a, sec- 
ondary: first i), 'definition' (primary: second i, secondary: first e), 'declaration' 
(primary: second a, secondary: e), presentation (primary: a, secondary: first e), 
'communication' (primary: a, secondary: u), 'pronunciation' (primary: a, sec- 
ondary: u), 'initiation' (primary: a, secondary: second i), 'variation' (primary: 
second a, secondary: first a). Note how important it is in these two last exam- 
ples to determine the number of vowel units since the units in one word are 
(i,n,i,t,i,a,ti,o,n', so it is the second 'i' which is two syllables on the left from the 
stressed vowel unit 'a', and the units in the the other word are (v,a,r,i,a,ti,o,n). 

Note that if the secondary stress rule is broken, the primary stress rule can 
be maintained: representation (primary; a, secondary: first e), differentiation 
(primary: a, secondary: first i), purification. 

Finally, these rules can be applied with the simple separator but not with 
the other separators affecting stress. 



23 



3.7 Categorise Vowel Occurrences 

The good thing about this moment of the process is that we can analyse each 
vowel unit independently. The only contextual information we need to know is 
whether or not the vowel unit is stressed, whether it is at the end of a word 
segment, whether it is followed by an 'r' and whether it is followed by a simple 
or complex consonant group (note that we talk about groups and not units for 
consonants, since consonant groups have not been treated yet) and whether the 
vowel is followed or not by another vowel group. 

These situations will allow us to categorise vowel occurrence under three 
dimensions: unstressed/stressed, natural/plain (only for stressed single vowels) 
and rhotic/non-rhotic. 

The categorisation between unstressed and stressed vowel units just inher- 
its the rules for stress processed before. Primary and secondary stressed units are 
considered stressed units. All the other units become unstressed. For instance, in 
'science', we have a stressed unit 'i' and an unstressed unit 'e'. Other examples 
where we show the stressed units: 'nuclear' (u), 'field' (ie), 'suprainguinal' (first 
'u' and first 'i'), 'vehicle' (e), 'Halloween' (first e and a), 'claustrophobic' (o, au), 
'kangaroo' (oo, a), software (o), telephone (e, o). 

The next issue is to distinguish between natural and plain occurrences. This 
only applies to stressed vowel units of one single vowel, i.e. (a, e, i/y, o, u/w), 
either annotated or not. In fact, this applies to positions, rather than to vow- 
els. The rules are basically the same as those which are typically learned when 
spelling English, known as the "double-consonant rule" . 

The first thing to do is to distinguish between single and complex consonant 
groups. Single consonants groups are b, c, d, /, g, h, j, k, I, m, n, p, r, s, t, z (all 
single consonants except x) and the groups #Z and #r (i.e., consonant -|- l/r). 
It is also single when y and w are consonants (e.g. 'yoyo', 'klwT'). The annotated 
double 'r' (r), as in 'pharynx' or 'parabolic', is considered a complex consonant. 
Note that q is typically followed by a semiconsonantic u which makes it complex, 
as well as groups 'gua', 'guo', 'guu', but not 'guc', 'gui' unless the 'u' is annotated 
as 'u'. Some examples of single consonants are 'pod' (p, d), 'ale' (1), 'hole' (h, 1), 
'able' (bl), 'metre' (m, tr), 'macro' (m, cr), 'micro' (m, cr), Apri'l (pr, 1), 'vogue' 
(g), 'opaqiic' (p,q) (in both cases the u is silent). Note that this also applies 
when a consonant has been removed, as in 'Michagl' (ch) (or 'pliiml^er', where 
we need to annotate to avoid the 'plume' sound), since consonant groups are 
formed after the application of '^'. All the other consonant groups are considered 
complex {th as in 'then', ck as in 'check', st as in 'lost', qu as in 'aqua', phth 
as in 'diphthong'). Some examples are 'temple' (mpl), 'checking' (ch, ck, ng), 
trophy (ph), graphic (ph), 'bathe' (th), 'lossy' (ss), ('equal') (qu, which is kw at 
this moment of the process). Note that gh is always considered complex when 
it is not silent 'rgugh'. When it is silent, it is not considered simple or double, 
but has a very especial implication. As we will see, it automatically makes the 
previous vowel (if single) a natural vowel, so we consider 'hi', 'high', 'night' and 
'hgight' equivalent. Consequently, we do not annotate 'tighten'. 
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Other annotations may affect the appraisal of simple or complex consonant 
groups. For instance, semiconsonants y and w are always reckoned as a conso- 
nant, so 'ni' in 'bunion' is now considered a complex consonant). Similarly, 'n' 
in 'lasagna' is a complex group. The consonant groups which include di, de, ti, 
etc., as in procedure or notion, are counted as the consonant letters they have 
inside. For instance, notion is single, while mission is complex. 

From the notion of single/complex consonant group, we can easily derive 
the notions of natural and plain position. A natural position of a stressed single 
vowel is given when: (1) it appears at the end of a word segment, (2) it is followed 
by another vowel unit, (3) it is followed by a single consonant group plus a vowel 
group, or (4) it is followed by a silent 'gh' at any position. Examples: so, me, 
my, high, night (note that 'gh' disappeared in previous steps of the process), toe, 
giant, poet, micro, plate, phony, dinosaur, able, idle, nitclear, vague. All the other 
stressed positions are plain positions. Examples: saffron, little, twenty, aqua, 
temple, checking, graphic (ph), pharynx, tougher. Note that for non-stressed 
vowels, this does not apply, so we have to annotate 'fortnight', 'identity', etc. In 
general, we will use the terms "natural" and "plain" vowels to refer to vowels in 
natural and plain positions respectively. 

Finally, (single or complex) vowel units can be rhotic or non-rhotic^. If 
the vowel is not followed by an 'r' then it is clearly non-rhotic, as in 'fate', 
'fat' or 'hungry'. Unstressed single vowels are considered non-rhotic ('winner', 
'eradicate', 'sugar', 'lantern', 'player'). For the rest of combinations (stressed 
single vowels and stressed/unstressed complex vowel units), the rule is as follows: 
a vowel unit is rhotic if it is followed by a single (non-doubled) 'r' (in the segment) 
('fair', 'laird', 'fairy', 'earring', 'card', 'far', 'norm', 'first', 'her', 'firry', 'praygr', 
'maygr', but not 'player') or double 'rr' not followed by a vowel ('err', 'whirr'). It 
is non-rhotic elsewhere. Explicitly, it is non-rhotic when it is not followed by an 
'r' (mail, now, glade, mode, pad, fitted), is followed by a double rr -t- consonant 
(carry, berry, lorry, hurry) or it is followed by 'r' (miracle). 

Since unstressed vowel units are always non-rhotic, there can be secondary 
stress annotations to force a vowel to become rhotic, as in 'ironic', 'urbane', 
'nowhere' even though these words are not typically transcribed in IPA with the 
secondary stress. 

Note that the pronunciation varies (and it becomes rhotic or not) for RP and 
GA. For instance, the (non-annotated) words 'carry', 'berry', 'stirring', 'lorry', 
'furry', 'hurry') are only pronounced similarly for 'berry', 'stirring' and 'furry', 
with annotations 'berry', 'stirring' and 'furry'. In some American accents, it also 
happens for 'carry'. In other case, both the pronunciation and the annotation 
are different, as in 'hurry' (RP) and 'hurry' (GA), and 'lorry' (RP) and 'lorry'. 
Other cases the same annotation is useful for both RP and GA, even though the 
pronunciation is different. For instance, laureate is annotated as 'laureate', which 
in both RP and GA distinguishes from the sound of 'for'. Table 9 summarises the 



Note that these terms are related but different to their application for dialects, as 
rhotic and non-rhotic accents. 
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possible combinations (the annotations will be better understood in the following 
section). 

3.8 Evaluate Vowel Units 

Evaluating vowel units is the most complex part of the process, given the great 
variety of vowel sounds in English. Even though we have to consider every pos- 
sible case, most cases can simply be handled by a subset of the following set of 
rules and correspondences. 

We will first evaluate vowel units with one vowel (both stressed and un- 
stressed, rhotic and non-rhotic) and then we will evaluate vowel units with two 
vowels (both stressed and unstressed, rhotic and non-rhotic). 

Vowel Units with One Vowel . 

First we will deal with vowel units with only one vowel (i.e, o, e, i, y, o, u, and 
w). We distinguish three cases: unstressed, natural position and plain position. 

Non-annotated unstressed vowel units with only one vowel are pronounced 
as shown in Table 4. Note that the sound is independent of whether it is followed 
by an 'r', because non-annotated unstressed vowels are always non-rhotic. That 
means that 'iridium' makes a [ir-] sound for the first 'i' (not [31-] or [lar-]). This 
is handy in GA, since 'menhir' would make [-ir], while we need to annotate it 
for RP as 'menhir' to make [-lor]. which also works for GA. Note that we have 
to annotate elixir (or elixir), and kaffir to make the sound [-ar]. For soyvenir, it 
also requires an annotation because it is stressed (and then it is rhotic and it 
sounds [-rar]. 

Since 4 introduces some silent sounds for some vowels, it is only at this 
moment of the process that we know how many syllables a word has. It is just 
the number of vowel units which are not silent. 

From here we can define the sounds of stressed and/or annotated vowel units 
with only one vowel as Table 5 and Table 6 show*. The annotations apply to 
a single letter, but most of them can be extended to more than one letter with 
the following rule: if an annotated vowel is followed by a vowel which has been 
crossed out, then we can equivalently write the annotation spreading over both 
vowel. For instance: 

broQch = brooch, bre§,k = break, blogd = blood 
Many of the combinations with one letter are not very frequently used, many 
others (especially those with two letters or in rhotic positions) are never used. 
They are left blank in the tables. 

Vowel Units with Two Vowels . 

Now we address the sounds of vowel units with two vowels (for both stressed 

and unstressed). 

® The appeaxance of the symbol 'j' in the pronunciation of the vowel 'u' depends on 
the previous consonant and on the English accent. This will be explained below. 
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Unit 


IPA 


Examples 


Negative anno- 
tated cases 


Comments 


a 


L J 


above, equal, 
Portiigal, liar, 
arabian, bulwork, 

burglar 


although, artistic, 
message, adequate 




final e 


silent 


male, tie, paleness, 
§ye, they've, metre, 
adequate 


catastrophe 


Note how compound 
words are segmented 
to apply the rule 


e before final d 


silent 


robbed, stuffed 


naked, overfed 




e before final s 


silent 


rules, ladies 


coaches, thesfe 




e (elsewhere) 


[a] 


diet, eleven, barrel, 
ladder, silence, 
Berlin, adequate 


below, eleven, 
chicken, remake, 

eradicate 








indzgo, pretty, 
spaghetti, imply, 
sodium, inspiration 


alumni, digress, 
fakir, menhir, elixSr, 
kaifir 




final o 


[ou] 


cargo, potato, radio, 
turbojet / turbojet, 
euro 




Note how compound 

words can be seg- 
mented to apply the 
rule 


before final s/es 


[oo] 


tangos, potatoes, 
euros 


chaos 




(elsewhere) 




atom, comply, 
saffron, tortoise, 
labor, morality (RP), 

tedious (this is ad- 
dressed by the -ou- 
rule), European 


ion, coerce, 
mortality, morality 
(GA)' 




u 


[a] 


circumstance, 
dictttm, lemwr, 
murmtir, 

sulfphttrous, figure 
(RP), tedious (this 
is addressed by the 
-ou- rule), suspect, 
spatula 


Bantu, iinlqye, 
Portugal, undo, 

uranium, urbane, 
fail tire / failure, 

Y ' 

tenure, figure (GA) / 
figure (GA), jiirassic 





Table 4. Pronunciation of non-annotated unstressed vowel units with only one vowel 
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Class 


Unit 


IPA 


rhotic 


Examples 


Natural 


a, a*, natural 
a 


[ci] 


[ek] 


mote, champogne, hasten, range, gayge, bass, 

Cambridge, ache, membrane , fare, variation 


e, e*, natural 
e 


M 


[iif] 


be, me, scene, key, he's, protein, equal, mere 


l/y, t*/y*, 
natural i/y 


[ai] 


[aiK] 


bite, my, guy, hi, tighten, high, night, mind, 
Fm., identity, arthritis, wi-fi, fire, mobile (RP), 

fortnight 


0, oS, natural 




[ou] 




so, go, tone, don't, most, broQch = brooch, 
protSn, coerce, more, you're (RP), thoygh 


natural u/w 






yQU, mwte, ifniqye, Neptwne, cwre, rwle, frwit, 
yQw're (GA), trpiighout 


Plain 


a, a*, plain a 


[ae] 


[a.] 


cat, taxi, balance, far, have, national, Spanish, are 


< ' 1 • 

e*, plain e 


[ej 


[3JJ 


pet, mess, scent, present, yeah, her, werg 


plain i/y 




[3JJ 


pit, lynx, picking, mirror, miracle, pity, sigve, 
first, whirr 


0, 0*, piam 






pot, bottle, bond, gone, robin, cough, for, sort, 
your (RP) 


v/'w. 

|)1<1111 11^'' U' 






cut. l)7/l')l)lo. cl'/ISti. tQWCll. ('()7/Slll. fy/l'. 7/rl)flllC 


Broad 




[U.J 


In t1 


T>'ji~n('*T' /im SI /"7 r 1 ^ n /■) n 


e, e* 


[ei] 


[3k] 


cafe, break = break, eh, there, weor, dossier 






[ik] 


machine, pizza, skiing, iiniqye, fier, emar, qy§.^, 
p^er, menhir, Zaire 


0, 0* 


L J 


L J 


bought, broad 




[u:! 

L J 


[ukI 

L J 


stZpcr, gurtZ, Bantu, ruhr, ctom, tUJo 


i-Diphthong 


a, S* 


fail 

L J 


[aiKl 

L J 


paella, mSestro, DalS, Diib§i 




fail 


[aiK] 


5yc, h§ight (but hgight is preferred), Either (RP), 
EinstSn, FahrenhSt, 


t, I* 






Not used 


0, oi 


[wa:] 


[wa:j] 


fSie, pat5i§, memSir 


u, u* 


[je] 


[ja-i] 


failure, teniire, ammiinition (GA), Mercury (GA), 
cellular (GA), tubiilar (GA), currlciilum (GA), 
occiipy GA 


u-Diphthong 


a, a* 


[au] 


[auif] 


tau, Laos 


e, e* 


[oi] 


[dik] 


Freudian 


t, t* 






Not used 


0, 0* 


[auj 


[auifj 


now, northbound 


u, w* 






uranium, figure (GA), situation (RP) (it can also 
be annotated as situation (RP), note that in GA it 
is 'sitiiation'), sensual (RP) (in GA it is 'sensiiar), 
ammunition (RP), Mercury (RP), cellular (RP), 
tubular (RP), curricuiilum (RP), occupy RP 



Table 5. Pronunciation of stressed and/or annotated vowel units with one vowel (Part 
1/2) 
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Class 


Unit 


IPA 


rtiotic 


Examples 


Clear 


a, a* 






Maghreb 


~ — 

e, e* 


r^i 
[a3j 


[qjJ 


serge§.nt, clerk [RP] 


I, I* 






mcrmgue, Imgerie 


0, 0* 


r 1 

w 




some, love, does, money, blood = blood, borough 
(RP) 


u, u* 






Not used 


Central 


a 


r ^ 




any, many, agajn, sajd 


e 






Goethe, genre 


t 


Vn 




eA'/l, pencyl. cpus/n. clixyr. kaff yr, terrible, mo/le 









Alobiiis, CTOOtho. word, worst, l^orougli (CVj 


'li 






b'^iry 


lotted 


d 


[A 




orange, message, eabbdge 


e 


[A 




English, pretty, ended, women, Karate 


i 






undistinguishable from i 


6 


[A 




women 


u 


[A 




btisy, minute, lettuce 


Rounded 


a 




bA 


all, call, walk, dlso, water, war, Utah 


e 


[ou] 




seiy, se^n 


I 






Not used 





[u:] 




to, do, shoe, lose, royte 


u 


[ou] 


[oj] 


m^ilve, §,?2bergTne, chg,uffeyr, burgg,ux, sure (RP) 


Opaque 


a, a* 


[o] 




wdtch, what, was, owssie, yogbt, warrant, 
liiurcate, restaurant (GA, formal), restaurant 

(RP, formal) 


— — 

e. e* 






envelope, en royte, genre 


I. ■/* 


L'J 




laigerio (CbV) 


0, 0* 




[uk] 


woman, wolf, good, would, could, tour, your (GA) 


U, M* 


["] 




put, push, Qur^n, Jurassic 



Table 6. Pronunciation of stressed vowel and/or annotated units with one vowel (Part 
2/2) 
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The rule seen above that an annotation which spreads over two vowels is 
equivalent to the annotation of the first vowel and the deletion of the second 
vowel is very useful here. This is especially useful for diphthongs: Se, 5y, 6w, eu, 
etc., and other digraphs oo, which can be used to break the by-default rule. 

In this way there is no need to determine the sounds of all the pairs with all 
the annotations. According to the previous rule, this was done in tables 5 and 
6. It is only necessary to give the information that Table 7 provides^, which is 
the default sound of the English digraphs which make a unit. For completeness 
(this is not necessary), we include some examples of the rest of digraphs which 
do not make up a unit in Table 8. 

The appearance of the symbol [j] in the combination eu/ew in Table 7 follows 
the same pattern as the pronunciation of natural 'u' in Table 5 and Table 6. While 
'fume' and 'few' always perform the sound [ju:] in all accents, and 'June' and 
'jew' always perform the sound [u:] in all the accents, there are differences with 
other phonemes. Our coding assumes [ju:] everywhere and will try to reduce 
that in the postprocess stage, depending on the accent which is desired. 

Finally, table 9 summarises the behaviour of several cases which are rhotic 
and non-rhotic, and their final pronunciation. We do not distinguish between 
North, sort, warm and force, tore, boar, port, but they do sound differently in 
Ireland, Scotland and Wales, and parts of the United States. Precisely, it is not 
a question of being rhotic or not, since all these dialects pronounce the 'r'. The 
difference is more about openness. If we want to highlight the difference we could 
use the following annotation for the second group: force, tore, boar. We do not 
distinguish either between fern, fir and fur, even though they may sound (or used 
to sound) differently in Scotland and parts of Ireland. In any case, no annotation 
would be needed here, since the letter (e, i, u) can be used to guess the specific 
pronunciation in each case. 

3.9 Distinguish Consonant Units in Groups using Digraphs 

Once finished with vowels, we start with consonants now. Even though there are 
many more consonants, the process is much simpler. The first thing we have to 
do with consonants is to recognise the consonant units, which means to recognise 
the digraphs. Given a consonant group such as 'phth' in 'diphthong' we do not 
parse it from left to right, but we follow a priorised order of digraphs. If there is 
a silent annotation, this does not avoid the formation of a digraph. For instance, 
'sgh' allows the formation of the digraph 'sh'. 

Said this, we identify digraphs following this order: 

1. The digraphs 'kn', 'pn', 'gn', 'cn' become [n] at the beginning of a segment. 
Examples: 'know', 'knee', 'gnaw', 'gnome', 'pneumonia' 'forekno-y ledge'. It 

is maintained in 'acknowledge' (although the 'k' sound would be preserved 
because of the preceding 'c'), 'darkness', 'banknote', karyopyknosis 

^ The appearance of the symbol 'j' in eu/ew depends on the previous consonant and 
on the English accent. This will be explained below. 
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Pairs 


IPA 

(Stressed 


IPA (Un- 
stressed) 


Examples 


Annot. Exceptions 


aa 


[a:], [aa] 


[a:], [ai] 


Maastricht, bazaar 


intraaxillary 


ae 

ee 
ea 


[i:], [IK] 


r;.l Frxrl 

[i:J, [iKj 


Daedalus, anaemia, 
conchae, amgebae, al- 
gae, aeolian, chimaera, 

cochleae 


Michagl, cg,esarean, 
g,esthetic, agroplane, 
agrobics, mSestro, Fagroes 


see, seeing, deer, dundee, 
attendee, chimpanzee, rein- 
deer 


reexamine 


[la], [laa] 


mean, seal, fear, rear, 
anneal, fishmeal, laureate. 
Boolean, cereal, nuclear, 
linear, area, cornea, cochlea 


real, idea, yeah, European, 
Gyinca, spreg,d, e^rly, great, 
break, breaking, creation, 
wear. Ocean, Cetacean, 
cocjjleae 


ai/ay 
ei/ey 


[ei], [ek] 


[ei], [ck] 


bay, bait, fair, highway, 
always, Friday, funfair, cock- 
tail 


siiys, DalSi, certain 


[i], [en] 


they, eight, neighbour, 
jQurney, honey, sovgreigri, 
forfeit, heir, their, foreign 


being, beirut, protein, 
atheist, vcjjicle, sSsm9logy, 
height, conceit, perceive 
(although porcoivc is pre- 
ferred), wa.htcwoir. woird 


au/aw 


[oi], [oi] 


[o:], [oj] 


jaw, fawn, automatic, lau- 
reate, centaur, dinosaur, 
Australia 


maiive, gauge, HawSi (note 
how the cluster 'aw' is bro- 
ken), Australia (some ac- 
cents), laureate (a short 
non-rhotic 'o' sound, like in 
'lorry') 


eu/ew 




[(j)u:], 

[(j)H 


new, few, nephew, feudal, 
beauty, euphoric. Euro, Eu- 
rope, eureka 


Freudian, closeup 


oa 
ow 


[ou], [oj] 


[ou], [oi] 


loaf, cocoa, workload, 
coarse, bczoar 


But: oasis, broad, N5ah 


know, owner, show, glow, el- 
bow, window, blown 


now, flower, brown 


ou 


[au], [auif] 


[a], [aj] 


hound, stout, flour, jealous, 
camouflage, labour, colour. 


court, soiil, poiiltry, tour, 
rgugh, CQUsi'n, CQuntry, 
jQurney, cgurage, woiild, 
route 


oi/oy 


[oi], [dik] 


[di], [dik] 


Boy, voice, turquoise, alloy 
(noun), alloy (verb), moira, 

coir 


going, cQy9te, eozoic, 
mem5ir, choir. 


oo 


[u:], [o-b] 


[u:]. [iJUj 


miKju, moor. 


good, blmxl, door 



Table 7. Pronunciation of vowel units with two vowels. 
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Pairs 


Examples 


ao 


kaolin, chaos, Laos, Macao, Maoism, cliaotic, aorta, gaol 


eo 


Cleopatra, Leotard, leopard, peQplc, con, Cheops, Geology, aeolian, 
eozoic, Leonardo, erroneous, Beowrilf, honicopathy 


oe 


goes, potatoes, doer, amoebae, Bogr, does, Boejng, coexist 


ia, ya, ie, 
ye, ii, yi, 

iy, yy, io, 

yo, iu, yu, 
iw, yw 


giant, via, cyan, diagram, Iberian, Belgian, phobia, science, quiet, diet, 
dyer, flyer, society, ambience, berries, married, field, believe, sieve 
(or sieve), friend, tier / tier, dying, crying, bio, violin, skhng, WH, 
HawSi, radii, lion, nation, cesium, Belgium, diuretic (note that the 'i' 
is stressed), handkerchief, serviette 


ua, ue, ui, 
uo, uu, 
uw, uy 


dual, animal, usual, nuisance, fluent, fuel, blue, argiie, byild, gluier 
(from gluey), byy, guy, Guyana 



Table 8. Examples of vowel pairs which are not units. 



2. The digraph 'ph' becomes [f] at any position. Examples: 'trophy', 'philosophy'. 
Some words use a separator to avoid applying this rule: uphill. In other cases, 
an annotation may indicate a change of sound, as in Stephen (note that the 
consonant group is still considered complex). 

3. The digraph 'ch' becomes [tj] at any position. Examples: 'chin', 'rich', 'reach- 
ing', 'Deutgchmark'. 

4. The digraph 'sh' becomes [J] at any position. Examples: 'shine', 'vanish', 
'ashes', 'sghmooze'. Some words may use a separator or annotation to indi- 
cate an alternative pronunciation: 'jcaloushood', 'threshold' 

5. The digraph 'ps' becomes [s] at the beginning of a segment. Examples: 
'psychology', but 'collapse'. Separators can be used to apply the rule: 'prepsychotic' 
or 'prepsychotic'. 

6. The digraph 'rh' becomes [j] at the beginning of a segment. Examples: 
'rhyme', 'rheumatism'. 

7. The digraph 'pt' becomes [t] at the beginning of a segment. Examples: 
'pterodactyl'. 

8. The digraph 'th' becomes [3] before vowel (a,e,i,o,u,y) and [6] in any other 
position. Examples with [S]: 'then', 'that', 'thus', 'the', 'this', 'thy', 'father', 
'bother', 'leg,ther', 'weather'. Examples with [9]: 'three', 'mouth', 'birthday'. 
Some words may use a separator or annotation to indicate an alternative 
pronunciation: 'thing', 'with', 'T^jomas', 'eighth', 'adulthood', 'rhythm' / 
'rhythm'. 

9. The digraph 'gg' becomes [g] elsewhere. Examples: 'bigger'. The other com- 
mon sound must be annotated: exaggerate. Note that, 'suggest' (which is 

generally pronounced as 'suggest') has some peculiar pronunciation in some 

American accents: suggest 
10. The digraph 'ss' becomes [s] elsewhere. Examples: 'mess', 'lossy'. 
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Units 


IPA 


Examples 


Unstressed single vowels 
(equivalent for rhotic and 
non-rhotic) 


[8], [i] 


Color, honors, iridium, irascible, farmer, 
arachnid, irrational, erase, uranium, pat- 
tern, urethra, arithmetic, aroma, erotic 


Unstressed complex vowels 
(rhotic) 


[qj], [lif], [eb], 
[oj], [(j)uK], [aiif] 


funfair, centaur, dinosaur, linear, reindeer, 
Eurasian, bezoar. 


Unstressed complex vowels 
(non-rhotic) 


[ai], [i:], [i], N, 
[o:], [ou], [(j)u:], 
[ai] 


Beiriit, eureka, Stearrhea, urbane (note 
that the second -ur in murmur has a dif- 
ferent sound) 


Stressed single vowels 
(rhotic) 


[en], [ik], [aiif], 
[oj], [(.i)!Jif], [qj], 
[aujf], [3j] 


fare, mere, fire, more, cure, e§,rly, far, her, 
first, for, fur, fierce, there, tier, err, whirr, 
Mary, lorry (GA sounds like lorry), hurry 
(GA), furry, bury (GA), firry, jgurncy, 
star, starring, forest (GA), coral (GA), 
iron, cogur, war, word (word would work 
as well), urine, yQu're (note that this 
case becomes one single segment), tyrant, 
diferring / difcring, ironic (ironic would 
yield it non-rhotic), nowhere (the alterna- 
tive nowhere would make it non-rhotic), 
urbane, ergonomic (note that the 'e' has 
a secondary stress. 


Stressed single vowels (non- 
rhotic) 


[ei], [i:], [ai], [ou], 
[(j)u:], [ae], [e], 
[i], [d], [a], [q:], 
[o:], [au], [wa:] 


marry (RP), marry (some American di- 
alects), berry (RP, GA), herring (RP, 
GA), mirror (RP, GA), lorry (RP), 
hurry (RP), biiry (RP), wSrrant, forest 
(RP), coral (RP), terrible, territory (first 
non-rhotic, second rhotic in GA and 
unstressed in RP), direct, pirate / 
pirate, error, Gary, parabolic, comparison, 
orange, miracle, simp, Sirius, pyromaniac, 
pharynx, hayrick, rerun. 


Stressed double vowels 
(rhotic) 


[ek], [ik], [aiif], 
[oj], [(j)uif], [qj], 
[auif], [31], [waiif] 


Eire, bear, earring, praygr, maygr, Europe, 
Jjeir, their, fiour, moor, deer, dear, wear, 
aural, soar, choir 


Stressed double vowels (non- 
rhotic) 


[ei], [i:], [ai], [ou], 
[(j)u:]. [<i:], [au]. 
[jij, [u:] 


laureate, showroom. 



Table 9. Summary of rhotic and non-rhotic vowels before 'r'. 
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Note also the order. Only the pairs ('ps'-'sh', 'ss'-'sh', 'ps'-'ss', 'pt'-'th', 'gg'- 
'gn') can overlap, as in the interjection 'pshaw", which is first converted into 
'p[J]aw', and then the rule 'ps' cannot be applied and if is finally [p.f]aw. The 
same happens with 'troopship' (alhtthough 'ps' is not at the beginning of a 
segment). 

Once the previous conversions have been made, we only need to analyse letter 
by letter. 

3.10 Evaluate Consonant Units 

In tables 11, 12 and 13 wc show the conversion of the consonant units into IPA 
symbols. Many rules are just straightforward. Many other rules are just scattered 
around but correspond to a simple rule. For instance, the annotations '', " 
typically correspond to the sounds [J], [3], [tj], [d3] (except 'x', which corresponds 
to [kj]), independently of the annotated consonant involved, as shown in table 
10. 



Unit 


IPA 


Examples 


^ (except 'x') 


[J] 


special, ocean, crescendo, conscience, champagne, machme, sugar, sure, 
pension, passion, pressure, censure, tissue, cautious, inertia 




[3] 


lingerie, genre, jet^, illusion, me9,sure, equation, sSzure, abkjjazian, 

Brezhnev 




[tJ] 


cello , Czecli, question, nature, culture 




[d3] 


soldier, graduate (noun) / graduate (verb), procedure, exaggerate, 
Belgian, gorgeous, dungeon, Zhoy 


X 


[kJ] 


luxury 



Table 10. Pronunciation of the annotations 'J, " 



After these tables, we have converted everything into IPA. Before going to 
postprocessing, let us see in table 14 a summary of how c, g, and q are handled. 

3.11 Recompose the Words 

This step is just gluing all the units together to the level of word. 

3.12 Postprocess the IPA result 

Finally, the last step performs some small adjustments in some cases. 

First, we perform the elimination of repeated consonants. For instance, words 
such as 'nutty', 'lorry', 'cure', 'scent' produce [uAtti], [Idjji], and [ssent]. Re- 
peated consonants must be eliminated. 
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Unit 


IPA 


Examples 


Comments 


b 


[b] 


above, bay, rib 




c (before e, i and 

y), c 


[s] 


ice, cyan, scene, accent, flagcid 




c elsewhere, c 


L J 


cat, sick, biscyit, ache, Celtic, 
accent, schema, chord, drachma, 
Micjjagl, cjjaos, scjjool 




c, ci, ce, cy, sc, 


[J] 


special, ocean, crescendo. 




sci, see, scy 




conscience 




c, ci, ce, cy, cz 


[tJ] 


cello , Czech 


digraph 'ch' doesn't reach 








this point. 


ch 


[J] 


champagne, machine 


digraphs 'sh' or 'ch' don't 
reach this point. 


d 


[d] 


dice, red, ladder 




d 




ripped. Apartheid (RP. formal) 




d, di, de, dy, dj 


[d3] 


soldier, graduate (noun) / graduate 








(verb), procedure, Djibouti 




f, gh, V 


[f] 


fit, carf, coygh, tQugh, lieutenant, 








trophy 




f, Ph 


w 


of, Stephen 


digraph 'ph' doesn't reach 








this point. 


g (before e, i and 




gin, bridge, exaggerate, gagl, gin- 


The digraph 'gg' does not 


y)i ff; 9h 9^! gg 




ger, Belgian, gorgeous, dungeon, 
Los Angeles, advantageous (note 
the primary stress is on the last 'a') 


reach this point. 


g elsewhere, g 


[g] 


goat, mug, finger, ghost, target, 
ghetto, Spaghetti, girl 


The digraph 'gg' does not 
reach this point. 







louffh. uffh 




n 

y 


kl 






1 '1 


Tii 
Uj 


1111 liMii IIm (miii:hi tmi]iivp 




j, jj, je, jy 


[d3] 


jet, injure, Fijian 




3 


[h] 


fajita 




3 


[3] 


jete 




h (except final), h 


[h] 


hen, who 




h between vowel 


[] 


ah, eh, uh, oh, yeah, Sarah, Noah, 


This rule is applied at a pre- 


and consonant 




Utah, yeah, shah 


vious stage, so the vowels are 
interpreted as if the 'h' were 
not there. 


h, ch 


[x] 


loch / lough 




k 


[k] 


kit, black, baker 





Table 11. Pronunciation of single consonants (part 1/3) 
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Unit 


IPA 


Examples 


Comments 


I 


[1] 


lake, fall. 




I (non-rhotic), I 
(rhotic) 


[A 


colgnel 




I 

Y 


[j] 


tortilla 




m 


[m] 


mum, lame, hymn 




n (except before 
k, -ca, -co, -cu, - 
cw or -c), n 


[n] 


non, pint, piranha, 
lasagna, incur, 
incorporate, unkind, 
concave 


digraph 'ng' doesn't reach this point. 


ft (before -k, -ca, - 
CO, -cu, -cw or -c), 
n 


fe] 


donkey, ankylosis, Inca, 
triincate 


digraph 'ng' doesn't reach this point. 


p, gh 


[P] 


pet, cup, copper, hiccough 


digraphs 'ph', 'pt' don't reach this point. 


9 


[k] 


queen, Iraq 


digraphs 'qu' doesn't reach this point. 


r, r 


W 


rip, far, Mary, Paris 


the appearance of 'r', 'rr' or 'r' makes the 
position rhotic or non-rhotic and affects 

^"rl£l XTT^ixriila in Tnc»n"ir /nitTOT'OTTi" urc^^rQ ac; c;r*r*"n 
lillt; V UWtJlo ill llldllV viili-Ci tJllli Wd V db 

in a separate table. 


s, s (starting 
's'; 's' between 
consonant and 
vowel; any other 
case when pre- 
ceded or followed 
by a voiceless 
consonant: c, f, h, 
k, p, t, x) 




sin, snake, slave, must, 
also, tense, horse, first, 
it's, pets, rates (note 
that the e is silent here), 
mess, lossy, deaths, yes, 
house, venus, us, lEisagna, 
thesis, whistle, dress, pos- 
sible, serious, seriousness, 
baseball, baseline, bisect 


A voiceless consonant is any of 
(c,f,h,k,p,t,x) or an annotated conso- 
nant or digraph whose sound is voiceless 
(e.g. th in 'baths'). Note: if 's' is alone 
in a segment, it becomes voiceless if the 
previous segment ends with a voiceless 
consonant group (e.g. it's). Note that the 
digraph 'ss' has been processed before 
(and it is voiceless). 


1, ss s (between 
vowels; between 
vowel and voiced 
consonant; after 
vowel or voiced 
group at the end 




easy, houses, lose, posgess 
(first 's' is voiced), wisdom, 
transmission (RP), Thurs- 
day, business, raspberry 
(RP), raspberry "(GA), 
Disney, autism, isn't, 

c 1 u rn ("1 1 r\ m r ciG dliircnrc 

coaches, lads, metres (note 
that the final 'e' is silent, 
so we consider the 'r'), me- 
ters, tales (note that the 
e is silent here), bridges, 
cherries (note that the e 
is silent here), robs, bars, 
fans, Charles, Ann's, he's, 
scis_sors 


A voiced consonant is any of 
(b,d,g,j,l,m,n,r,v,w,y,z) or an anno- 
tated consonant or digraph whose sound is 
voiced (e.g. th in 'clothes'). This sound at 
the end of a segment may finally become 
voiceless if close to another segment which 

otdiljo Wltll a VUiecicao giou^. J?U1 llloLdliCc, 

in 'The cat is g(5od', we have that 'is' 
makes its voiced sound, while in 'The cat 
is strange' it makes a voiceless sound. The 
annotation is not affected. Note: if 's' is 
alone if a segment, it becomes voiced if 
the previous segment ends with a vowel or 
a voiced consonant group (e.g.: He's). 


s, si, se, sy 


[3] 


illusion, measure 




s, si, se, sy, ss, 
ssi, sse, ssy 


[J] 


sugar, sure, pens^ion, 
passion, pressure, censure, 
tissue 





Table 12. Pronunciation of single consonants (part 2/3) 
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Unit 


IPA 


Examples 


Comments 


t 


[t] 


tame, cat, site, fatty, 
Anthony 


Digraph 'th' doesn't reach this 
point. 


t 


^] 


computer, pretty, gotta 
(some accents) 


This is not used in formal tran- 
scription. . In fact, the rule is 
that this happens for American 
English when 't' is between vow- 
els. The same could be defined 
for d, which sounds similarly in 
some accents for some words 


t, ti, te, ty 


in 


cautious, inertia 




t, U, te, ty 


[tj] 


question, nature, culture 




t, ti, te, ty 


hi 

LoJ 


equation 




th 


[te] 


'eighth' 


digraph 'th' doesn't reach this 
point. 






VCI1II5 odVCj odV 




y 


[f] 


Brezhnev, IStmotly 




w, w,^ 


[w] 


win, twin,^6nce 


Note that w in vowels have been 

piUCcoocU. picVlUUoiy, oUCll do Hi 


w 


[VJ 


lOLLwenei ^^aimuugii 11 lo 

ctloU oilllUiy pi UliU LlliCCLl clb 
HJ L L W CiiCl (, V V 1 1 1 &eiiO tCili 


INULfc: LlidL W' ill VUVVCih lldVc Uccli 
jJi UCcbbtJLl JJi C V lU Llbl_y J bLlCli do 111 


w 


Twl or rAvl . 
depending on 
the accent 


when, why 




wh 


M 


when, why 


The annotator has indicated 
that she wants the sound to be 
pronounced as [hw] 


X (before stressed 


LcJ J 


example, exaggerate 




vowel), X 








X (any other posi- 


[ksl 

L J 


extra, extreme, galaxy, taxi 




tion), X 








X 


M 


xyloplioiin. bureaux 




X 


[kj] 


luxury 




y 


[j] 


yale, yes 


Note that y in vowels have been 
processed previously, such as in 
'ray'. 


z 




crazy, zoo, buzz, Jzar 




z 


[ts] 


pTzga, Alzhgimer 




z 




glitzy 




z, zi, ze, zy, zh 


[3] 


seizure, abkjjazian. 








Brezhney 




z, zh 


[d3] 


Zhoy 





Table 13. Pronunciation of single consonants (part 3/3) 
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Unit 


IPA 


Examples 


Negative cases 


(//;. + aiusoiiinif. q + 


M 


li'iui. (Juraii 




qu + {a,e,'i,o,u,y} 


[kw] 


queen, quality, quick 




gu + {a,o,u\ 




bijingual, guano 


contigiious, gyard 


gu + {e,i,y} 


[g] 


guess, guilty, guy 


guard, distinguish 


g + {a,o,u\ or consonant 


[g] 


gas, gun, green, bag 


gagl 


fi + {<'-/i,y} 




gin. gene 


girl 


c + {a.o.ii} or coiisoiiaiit 


M 


can, cnlt, crazy, l)lack, crystal 




c + {e,i,y} 




centre, cyan 


celt 



Table 14. Summary of sounds for q, g, c + vowel 



Second, the combination of some consonants can produce pairs which cannot 
be properly pronounced. The clearest case is the ending in -tre, as in 'metre', 
which produces —>■ [mi:tj], but should be [mi:taj], by introducing the schwa (an 
unstressed sound). Does this mean that 'metre' should be annotated as met're? 
Does this happen for other consonant combinations? Can we distinguish the 
right positions? (Wc do not have the problem for 'central' or 'centring'). 

If we first analyse the positions, the problem appears to be when '#r' is not 
followed by a vowel. But consider the word 'fibre-optics'. If we consider the two 
segments together and annotate it like this 'fibrcoptics', then the pronunciation 
is not correct. So the rule is confirmed. The problem appears when '^r' is not 
followed by a vowel inside the segment. 

Regarding the question of the combinations, it is very difficult (e.g. [t9] works, 
but [9t] does not, while both [ts] works and [st] do, and of course [bd], [ft], etc.). 
We could analyse the approximately 400 combinations of pair of consonants to 
sec whether they articulate well together, where many of them never happen. 
Instead, the idea is then to derive a very simple rule to cover the most frequent 
cases, and then annotate any possible exception. This is the philosophy taken 
for the process. 

We distinguish two common situations: 

— [-1], [-m], [-n]. Examples: able, girl, little, rhythm, autism, isn't, re§,lm, farm, 
fern. There is no need to insert a schwa, even though in some cases it seems 
that a schwa might be convenient (rhythm rhythm' , autism autism, 
isn't isn't). 

— [-i]. Examples: fibre, ogre, metre. We need to insert a schwa. 

So the rule will be to introduce a schwa between a consonant + 'r'/'rs' at the end 
of a segment, as in 'metre' or 'metres', which initially produces [miitr] but 
finally it is converted into [mi:tor]. Note that this does not conflict with words 
such as 'genre', because the 'r' is not at the endof the semgent. In some other 
cases where it is also necessary, we will annotate it. For instance: 'ibn' . 
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The opposite problem is whether we want to reduce idol, open, icon, tighten, 
fasjen, cottons, isn't, etc. In this case, if the annotator wants to do it because in 
her accent 'idol' and 'idle' are the same, it is just enough to use the annotation 
" for that (e.g. idgl, instead of idol). 

As a summary, we will only automatically deal with the 'r' (which does not 
frequently happen with American spelling except for 'acre' and some other few 
words), and the rest will be left to the annotator (as in autism, or opgn ). 

Now, wc deal with different English varieties. The following conversions are 
performed for GA: 

[k] [j] (e.g. 'fire', 'beer') 
[3:] ^ [3] (e.g. 'fur') 
[d] [a] (e.g. 'pot', 'bother') 
[a:] [a] (e.g. 'father') (note that wc annotate 'fast' and not 'fast' for GA) 
[i:] [i] (e.g. 'feed') (not all phonologists advocate for this phoneme change) 
[o:] [0] (e.g. 'feed') (not all phonologists advocate for this phoneme change) 
[u:] — > [u] (e.g. 'moon') (not all phonologists advocate for this phoneme change) 

The following conversions are performed for RP: 

[j] [j] (e.g. 'for', if followed by a non-silent vowel, such as 'for us', 'carry' or 

'rat') 

[j] — > [] (e.g. 'for', if not followed by a non-silent vowel, such as 'for them', 

'forth' or 'furs' 

[k] [aj] (e.g. 'dear', if followed by a non-silent vowel, such as 'dear Ann', 

'firing' or 'dearest') 

[h] — > [a] (e.g. 'dear', if not followed by a non-silent vowel, such as 'dear John' 

or 'moors') 
[ou] [au] (e.g. 'tone') 

Other conversion rulescts can be established for other dialects (Australia, Scot- 
land, etc.) Note that the process only deals with these mergers at the end of the 
process. This is useful to distinguish between 'flower' and 'flour', since the first 
is [flauaj] in GA and [flaua] in RP, while the second is [flaui] in GA and [flaua] 
in RP. 

Another issue is the pronunciation of the diphthong [(j)ui], which is some- 
times performed as [ju:] and sometimes as [u:]. In tables 5, 6 and 7 we denoted 
by [(j)u:] the correspondence for 'eu/ew' and natural 'u'. While 'fume' and 'few' 
always perform the sound [ju:] in all accents, and 'june' and 'jew' always per- 
form the sound [u:] in all the accents, there are differences with other phonemes. 
In this final part of the process we eliminate the symbol (j) depending on the 
accent. For instance, if we choose RP, then it is only reduced in the following 
cases: 

[rju:] [ru:] 
[lju:] ^ [lu:] 
[Jju:] - [J-u:] 
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[tjju:] ^ [tju:] 
[sju:] feu:] 
[dsju:] ^ [dsu:] 

In case we want to force [u:] for a word we can use u, gu, gw. In case we want 
to force [ju:] for a word we can use^u,^eu,^ew. 

Finally, the elimination of repeated consonants is performed again. 

With all this, we have the IPA pronunciation for each word. Note that the 
IPA produced will not be exactly a standard IPA because of the placement of 
stress. We place stress on vowels, while IPA place stress at the start of syllables. 
Placing the stress with the IPA notation will not be performed, since we have 
not considered the notion of syllable, only vowel and consonant units. 

4 Annotation Rules 

After the previous section, we know how an annotated word must be interpreted 
(i.e. pronounced) in a precise way. Consequently, for any word a reader may have 
doubts about its pronunciation, the previous rules can be applied and the true 
pronunciation is obtained. 

As discussed in the introduction, in order to read annotated English, some- 
one (or something) must have annotated the text previously. As seen in Figure 
°reffig:inteannot, we require two inputs for the introduction: the English word 
and its pronunciation, in order to produce an output: the annotated word. We 
can distinguish two issues here. First, going from the English word + pronuncia- 
tion to the annotated word can be easy in some cases, but cumbersome in others. 
In fact, the problem is like finding a transformation, using deletions, insertions 
and substitutions, that makes an alignment between the original English word 
and its pronunciation. The problem is easier than the general computational 
problem since we only have three possible insertions (", '''). For instance, the 
word 'house' and the pronunciation [haus] easily suggest that we find a possible 
sound for 'h' which matches the IPA [h-] , we also find a possible sound for 's' (s) 
which matches the IPA [-s], and we have that the digraph 'ou' can be converted 
into [-au-]. Finally, a deletion of the 'e' makes the correspondence perfect. This 
gives us the correct annotation 'housg'. Following the rules seen in the previous 
section, we can drop the last annotation and produce 'house'. 

The previous process is relatively simple for a human being if she has a good 
knowledge of the interpretation rules. A more difficult problem is how to make 
that automatically, i.e., by a computer program. This would require a coding of 
the rules, several transformation / alignment techniques and some off-the-shelf 
algorithms, such as dynamic programming. This is a solvable technical problem 
if we have the word and the pronunciation^'^. Given the small size of the words 
(each word is analysed independently) , there are no relevant computational cost 

^° The pronunciation of every English word (including plurals and inflected verbs) can 
be obtained from several corpora and webpages. 
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issues. The annotation process can be implemented in such a way that it anno- 
tates very fast on any personal computer. The only main concern to make this 
process automatic is the distinction of homographs which are not homophones 
(wind, live, use, read, lead, etc.). Some disambiguation techniques from compu- 
tational linguistics would be required, which can resolve a high proportion of 
cases. The most difficult cases might be left for human supervision (perhaps one 
in each ten thousand words). 

In this section we are not addressing the automation of the annotation pro- 
cess. We leave the analysis of the automation out of this paper. In this section, 
we discuss on a related problem we have to solve first before any manual or auto- 
mated annotation process can b(!gin. The problem is that, given an English word 
and a pronunciation, several alternative (and correct) annotations are possible. 
For instance, we can annotate 'head', 'head' and 'head'. 

An option would be to allow all of them (all of them can be interpreted and 
give the same, correct, result) and choose them randomly or by a lexicographi- 
cal order. However, having different options depending on the annotator would 
cause confusion, since the same word with the same pronunciation would look 
like differently depending on the occasion. Consequently, we need to settle a 
'standard' choice, which gives preference for one annotation over the rest. This 
is also shown at the bottom of Figure 1. 

4.1 Annotation Preference Criteria for a Standard Annotation 

In order to define a standard to choose one annotation over the rest, we can 
consider the following kinds of criteria: 

— Economy. The fewer annotations a word has the better. So, we should prefer 
'head' over 'head' and 'head', and we should also prefer 'house' over 'house'. 
In fact, the rules in the previous section have been devised to avoid annota- 
tions when the (part of the) word follows the rule, so it would be stupid to 
include an annotatation which is not required. 

— Simplicity. Some annotations are simpler than others. For instance, 'werg' is 
simpler than 'were'. 

— Explicitncss. Some annotations are more explicit than others. For instance, 
'were' is more explicit than 'werg'. 

— Frequency. Some annotations rely on more frequent symbols than others. 
For instance, 'head' is preferrable over 'head'. 

— Etymology. Some annotations are more consistent with the etymology or 
word formation rules. For instance, 'ago' is preferrable over 'ago'. 

The problem is that the previous criteria are conflicting. For some words, apply- 
ing one criterion implies going against some other criteria. There are two options 
here: (1) to devise a priority of criteria, (2) to give cost weights to each criteria, 
compute the costs and choose the annotation with the lowest cost. Note that the 
second option subsumes the first if wc define high differences between the costs. 
In any case, both options are fine whenever they make a choice and allow for a 
standard to be defined. 
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In what follows we propose a cost weighting approach. We exclude etymology 
from the criteria, since this would require additional information and would make 
the process difficult to automate. In fact, we give high costs to separators, since 
we do not want to see things such as 'ago', 'thirteen' or 'microphone', because 
they are not etymological. So, the costs we propose are shown in Table 15. 



Type of annotation 


Cost 


Comments 


Stress ('a' or 'a') 


30 




Silent ('g,') 


31 




Non-rhotic annotation ('cor') 


10 




Broad, idipnthong or udipnthong annota- 


34 




tion affecting one vowel 






Broad, idiphthong or udiphthong annota- 


37 




tion affecting two vowels 






Frequent Annotations affecting one vowel 


42 




(plain, natural) 






Double Frequent Annotations (plain, nat- 


53 




ural) 






Non-Frequent Annotations (the rest) 


44 




Double or Triple Non-Fi-equent Annota- 


55 




tions (the rest) 






Separation not affecting stress ('") not leav- 


65 




ing a eoiisouaiit gr(Mip alone 






Simple separation affecting stress (", ") 


66 




not leaving a consonant group alone 






Separation affecting stress (", 'J) 


92 




Introductions (", " or with or without 


200 




annotation, as in metre) 







Table 15. Annotation Costs 



For each annotation, we also reckon the position cost, which is an additional 
number we sums up the position number where all the annotations have taken 
place. This position is negative for deletions ('a') and positive for any other 
annotation. For instance, 'extreme' has position cost -1-1 -1-5 = -1-6, and 'je9,lous' 
has position cost -1-2 — 3 -I- 7 = -1-6. The position cost is only used to solve ties 
when applying the costs in Table 15. Generally, this means that with annotations 
with the same cost, we prefer the annotation to be leftmost unless it is a deletion, 
where we prefer it to be rightmost. For instance, 'dgor' has position cost —2, 
while 'dogr' has position cost —3 (hence 'doQr' is prcferrablc). 

We apply the costs in tables 16, 17 and 18 for some alternative annotations 
in the examples shown 15. 
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Type of annotation 


Comments 


von C^]) vnii ('44+31') von ('31+31") 

y vJ Ll \rj A. J ^ y KJ Li y tit: | 'J _ y W LI V'-'-'- 1 ^ / 




ilClflllCl (OOj. ilQiflllCl 








doQr (31), dQor (31), door (37), door (53) 


Position costs are —2 and —3 respectively, 
iieiice cluyi lo pieiciiauie. 


rt^llliLJ LcLLtJU. \^OUn^4:4: J ^ XT-liliULdLcU. 4:4: ^ 








gye (37), gyg (31+31), eye (34+31) 


eye (31) is not valid since it makes 'yee' 


-f--,.-. (0'7\ +„rn /'Q1_l_/l/1^ 

two t^O ^^oi+44j 


two (44) is invalid, since there is no semi- 

consonantic 'w sound 


1('n-r'l ( \'>\ f'Ci'i'l lin-r^l ffifil 

It, \ f.,i l±_). it.. \ f.,i lU'J), it.. \ f.,i lUU) 




DL y^aiS. yO 1 1 5 Ui l_<cLJ\. 1 04:n^0 J. 1 5 Ui Ucliv I O _L n^4:Zr f 




yOl J ^ Ui ccLivlll^ 1 O ( ) , UltigjiS-llig ^^O'in^Ol I 




Ke;^;- (dij, key [4z-\-6i.), key (5dj, kgy (^i + d4j 




couj^a (^44+oij, coyia (^44H-oiH-oij, CQtycl 

l^Oi-|-44-|-Oi ) 




tour (55), tQur (31 + 34), toyr (44 + 31) 




Dyiia t^oij, DUjiu (^44H-oij 




cousiii ^^oo-r44 ^ , cyusiii (^oi-r4z-|-44 J , coysin 

(44+31+44), cousin (31+42+44) 




■KiivA;^ (AA-\-']r\\ Kiir^r (AAj-fi^^ 

Dury y^^-\- L\j ) Dury i^'i^^-i-DOj 




cnngy l^ooj, cimgy (^44-|-jij 




bind (65), bind (42) 




nave (oi), nave (bo), nave (4zj 




microphone (30), microphone (42), microphone 


Note that some of these annotations are 


(ou i- ouj, micropnpne (y-^j, micrc^none 


not equivalent (secondary stress or not) 


comp (4z+olJ, comp (oo-t-olj 




siaecar (y^^j, siaecar (4z-t-oi-t-oUj 


-— -r— — — - — — p- — ^ — — ■ — - 

sidecar (42+31+34) does not make the a 

rhotic and it is invalid 


received (44+30+31), received (44+53+30) 




alumni (30+65) 


alumni (30+92) is not valid, since the 'i' is 

not stressed. 


undergo (30), undergo (92), undergo (92) 




nightmare (30) 


nightmare (42) is not valid because the 'a' 
should be rhotic. 


Birmingham (65+31) 


Birmingham (65) does not work. 


ago (30). ago (66). ago (66) 




fa,thca.(lc(l 


Tlic separation is needed here 


telephone (42+44+30), telephone (42+44+92). 




belief (44+37+30), belief (44+31+30+42) 





Table 16. Examples of Annotation Costs (1/3) 
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Type of annotation 


Comments 


your (55), your (31+42), your (44+34) 


GA pronunciation 


peQple (31), people (53) 




through (31), through (44+31), thrQugh (31+34) 




youth (31+34), youth (44+31) 




foyr (31), four (37) 




hgight (31), hgight (44) 




outgoing (30+30+65), outgoing (92+65), 
outgoing (92+65) , outgoing (92+65) 




evening (65), evening (66), evening (42+31) 




homeless (65), homeless (66), homeless (42+31) 




boredom (31), boredom (65), boredom (66) 




something (44+31+44), something (44+65+44), 
somgthing (44+31+65) 




anything (44+44), anything (44+65+44), 

anything (44+65) 




somewhere (44+31+34), somewhere (44+65+34) 




nowhere (65+34) 


nowhere (42+34) does not work, because 
'wh' is not at the beginning of a segment 
or after a consonant 


aside (66), aside (44+30) 




bisect (42+66), bisect (42+44+30) 




awhile (66) 


awhile (30) doesn't work ('aw' digraph). 


ahhough (44+30+31), although (44+66+31) 




nTratp ('49-1-10") niratp 


JJli. Clljtj kJ-VJC-O llUlj WKJLlS. Ill J-lil ^ OlllL/C; IjllC 1 

io IlUt lllULlC ilclc. 


(30+65+44+42+30) 




fipr (V7\ tier 49 


■j-ip-t* ic "\raliH "F(~»T A finrl "RP Txrliilp I'lpv nnlir 

LilCil lO VCLllV-i IWl vJX -Ty. dllvJ. XLX ^ WllllC LlCl vJlllV 

for RP. 


father (34), father (44) 


father is only useful for some American ac- 
cents merging father and bother. 


Denmark (30), Denmark (34), Denmark (34+31) 


'Denmark' does not work in RP, since 
the 'a' needs stress to become rhotic. 
'Denmark' does not work for GA 


direct (42+30), direct (42+66) 


'direct' does not work, since 'i' is not 


stressed. 


earring (31), earring (65) 


The double 'r' must be broken to ensure it 
is rhotic in RP 


target (44+44), target (65+44) 




house (44), house (65) 




equal (42), ecjual (65) 




macro (42), macro (65) 




turbojet (42+30), turbojet (92) 




incur (66), incur (44+30) 




closeup (44+31+30), closeup (44+92) 




scissors (55), scisgors (42+31) 





Table 17. Examples of Annotation Costs (2/3) 
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Type of annotation 


Comments 


ever (42), ever (65), evgr (31+200) 


The last option requires the introduction 
of schwa 


evil (44), evil (42+31) 




Qevii ^_<J-Lj5 aevii \^^z-\-^^j 




rhythm (44), rhythim (200) 




elixir (44+42+44), elixir (44+42+31+200) 




sigvg (31+31), sieve (42+31) 




beauty (31), beauty (31+31) 




failure (44), failure (200) 




eurosccptic (42+44+30), eurosceptic (92+44), 
eurosceptic (42+92), 




thirteen (30+30), thirteen (92), thirteen (92) 




nineteen (92), nineteen (30+42+31+30) 




foreknowledge (30+31+31+30+44+31+44), 
forekni-^ledge (92+42+31+44) 




Australo_pithecus (92+44+44), Australopithecus 

(30+42+30+44+44) 




framework (92+44), framework (65+30+44), 
framgwork (42+31+30+44) 




thrQugh9ut (31+30), throughput (31+34+30), 

throughout (44+31+30), throughout (44+31+66) 




arrangement (42+30+44+31), arrangement 
(42+30+65) 





Table 18. Examples of Annotation Costs (3/3) 
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5 Some Examples 



In this section wc include several examples taken from different areas, in order 
to see how annotations look like in real text and also to observe the proportion 
of annotation in general texts vs. formal texts. 

5.1 200 Most Common Words 

1 50: 

the of to and a in is it ygu that he was for on arg with as I his they be at 
one havg this from or had by hot word but what some we can out other werg all 
there when up use (verb, the noun would be 'use') yoyr how said an each she 

51 100: 

which do their time if will way about many then them write would like so 
these her long make thing see him two has look more day coiild go come did 
number sound no most peQple my over know water than call first '^ho may down 
side been now ffnd 

101-150: 

any new work part take get place made livg where after back little only round 
man year came show evgry good me givg our under name very through just form 
sentence great think say help low line differ turn cause much mean before move 
right boy old too same tell 

151-200: 

does set three want air well also play small end put home read (the past 
participle is 're^d') hand port large spell add even land here must big high such 
follow act why ask men change went light kind off need house picture try us 
agajn animal point mother world near by.ild self e§,rth father 

5.2 Miscellanea 

Numeral Numbers: 

^one two three foiir five six seven eight nine ten eleven twelve thirteen fourteen 
fifteen sixteen seventeen, eighteen nineteen twenty thirty forty fifty sixty seventy 
eighty ninety hundred thousand million 

Ordinal Numbers: 

first second third foy.rth fifth sixth seventh eighth ninth tenth eleventh twelfth 
thirteenth ... twentieth thirtieth hundredth thousandth millionth 

Days of the week: 

Monday Tiiesday We^ngsday Thursday Friday Saturday Sunday 
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Colours: 

Black white red blue green yellow pink violet gray /grey purple orange brown 
cyan magenta turquoise. 

Solar system: 

Sun Moon Egrth Mercury Venus Mars Jupiter Saturn Uranus Pluto 
Places: 

Europe America Africa Asia Oceania Antarctica United States of America 
England Britain Wales Scotland Ireland Australia New Zealand India Canada 
Spain France Germany Denmark Holland Italy China Brazil Japan Mexico Russia 
Pakistan London New York Birmingham Edinburgh Paris Rome Berlin Barcelona 

5.3 A Literary Work. A Tale of Two Cities 

A Tale of two Cities by Charles Dickens 

Book the First, Chapter I: The Period 

It was the best of times, it was the worst of times, it was the age of wisdom, 
it was the age of foolishness, it was the epocjj of belief, it was the epocjj of 
incredulity, it was the season of Light, it was the season of Darkness, it was the 
spring of hope, it was the winter of despair, we had everything before us, we had 
nothing before us, we werg all gomg direct to He§,ven, we werg all going direct 
the other way - in short, the period was so far like the present period, that some 
of its noisiest authorities insisted on its being received, for good or for evil, in 
the siiperlative degree of comparison only. 

There was a king with a large jaw and a queen with a plain face, on the 
throne of England; there were a king with a large jaw and a queen with a fair 
face, on the throne of France. In both cguntries it was clearer than crystal to 
the lords of the State preserves of loaves and fishes, that things in general werg 
settled for ever. 

It was the year of Our Lord ^ne thousand seven hundred and seventy-five. 
Spiritual revelations werg conceded to England at that favoured period, as at 
this. Mrs. Southcott had recently attained her fivc-and-twcnticth blessed birth- 
day, of -^hom a prophetic private in the Life Gyards had heralded the sublime 
appearance by announcing that arrangement (42+30+65) werg made for the 
swallowing up of London and Westminster. Even the Cock-lanc ghost had been 
laid only a round dozen of years, after rapping out its messages, as the spirits 
of this very year last past (supernaturally deficient in originality) rapped out 
theirs. Merc messages in the earthly order of events had lately come to the 
English Crown and Pegple, from a congress of British subjects in America: 
which, strange to relate, havg proved more important to the human race than 
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any communications yet received thrgugh any of the chickens of the Cock-lane 
brood. France, less favoured on the whole as to matters spiritual than her sister 
of the shield and trident, rolled with exceeding smoothness down hill, making 
paper money and spending it. Under the guidance of her Christian pastors, she 
entertained herself, besides, with such hiamane achievements as sentencing a 
yguth to havg his hands cut off, his tongue torn out with pincers, and his body 
burned alive, because he had not kneeled down in the rain to do honour to a 
dirty procession of monks which passed within his view, at a distance of some 
fifty or sixty yards. It is likely engugh that, rooted in the woods of France and 
Norway, there werg growing trees, when that sufferer was piit to deQ,th, already 
marked by the Woodman, Fate, to come down and be sawn into boards, to 
make a certain movable framework with a sack and a knife in it, terrible in his- 
tory. It is likely engugh that in the rgugh outhouses of some tillers of the he§,vy 
lands adjacent to Paris, there werg sheltered from the weather that very day, 
rude carts, bespattered with rustic mire, snuffed about by pigs, and roosted in by 
poultry, which the Farmer, De9,th, had already set apart to be his tumbrils of the 
Revolution. But that Woodman and that Farmer, thoygh they work unceasingly, 
work silently, and no jjnc heard them as they went about with muffled tread: the 
rather, forasmuch as to entertain any suspicion that they werg awake, was to be 
atheistical and traitorous. 



5.4 Songs, Poems and Sayings 

Rock-a-bye Baby : 

Rock-a-bye baby, on the trectop. 
When the wind blows, the cradle will rock. 
When the bough breaks, the cradle will fall. 
And down will come baby, cradle and all. 



All the pretty horses : 
Hush-a-bye, don't ygu cry. 
Go to sleepy ygu little baby. 
When ygu wake, ygu shall havg cake, 
And all the pretty little horses. 
Blacks and bays, dapples and greys. 
Go to sleepy ygu little baby, 
Hush-a-bye, don't ygu cry. 
Go to sleepy little baby. 
Hush-a-bye, don't ygu cry. 
Go to sleepy little baby. 
When ygu wake, ygu shall havg. 
All the pretty horses. 
Way down yonder, down in the meg,dow, 
There's a poor^^ wee little lamt)y. 

" 'poor' in RP. 
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The bees and the butterflies pickin' at its Syes, 

The poor^^ wee thing cried for her mammy. 

Hush-a-bye, don't ygu cry, 

Go to sleepy httle baby. 

When ygu wake, ygu shall havg cake. 

And all the pretty little horses. 

5.5 A Formal Text. Human Rights 

Universal Declaration of Human Rights 

Adopted and proclaimed by General Assembly resolution 217 A (III) of 10 
December 1948 On December 10, 1948 the General Assembly of the United 
Nations adopted and proclaimed the Universal Declaration of Human Rights, 
the fiill text of which appears in the following pages. Following this histOTic 
act the Assembly called upon all Member CQuntries to publicize the text of the 
Declaration and to cause it to be disseminated, displayed, reQ,d and expounded 
principally in schools and other educational institutions, without distinction 
based on the political status of cguntries or territories." 

5.6 A Scientific Text 

^First section of the article 'Australopithecus' (retrieved from the English Wikipedia 
on August 25th, 2010).^ 

Australopithecus (Latin ^australis^ "sQuthern", Greek iTLOrjKoa j)ithekos_^ "ape") 
is a genus of hominids that arg now extinct. From the evidence gathered by 
palagontologists and archaeologists, it appears that the Australopithecus genus 
evolved in eastern Africa around 4 million years ago before spreg,ding thrgughout 
the continent and eventually becoming extinct 2 million years ago. 

6 Conclusions 

This paper has presented ''Annotated English^, a proposal of diacritical anno- 
tations which converts English into a language whose pronunciation can be un- 
ambiguously inferred from its spelling (plus annotations). The novel principle, 
which distinguishes this from any spelling reform, is that we set as an unbreak- 
able principle that no single letter of English can be modified. Since English has 
virtually no diacritics (only in some loanwords, such as resume, creme, pifiata, 
etc., whose diacritics would be eliminated prior to the annotation process) we can 
use diacritics to annotate the pronunciation. ^Annotated English'' differs from a 
simple pronunciation without respelling (partially existing in some old spelling 
books, dictionaries and specialised books) in the fact that we introduce a set 
of rules which incorporate most of the usual pronunciation rules in the English 
language. In this way we not only minimise the number of annotations, but they 

'poor' in RP. 



49 



appear when the reader has the perception that the part of the word which is 
annotated is an exception to a rule. 

The proposal includes a set of rules and many annotation symbols. Rules 
include how vowels, vowel digraphs, semiconsonants, consonants, and consonant 
digraphs must be pronounced. A set of about 20 annotation symbols has been 
necessary, especially because there are some characters in English (most espe- 
cially vowels) which can have about 10 different sounds (without considering 
rhotic sounds). A first impression might be that the set of rules and the set of 
annotations are large and complex. We have to make some comments on this. 

— Rules: most rules wc have introduced here (double-consonant rule to distin- 
guish between plain and natural vowels, vowel digraphs, consonant pronunci- 
ation, final e, final ed, rhotic sounds, etc.) are just the same (or very similar) 
rules every speaker of English knows. This document just places everything 
together. For an average Ejnglish speaker, a short leaflet or schematic annota- 
tion table can be constructed as a mnemonic for beginners to the annotation 
system. 

— Symbols: we do not need to know all the symbols precisely in order to guess 
the right sound. For instance, if we see 'word', we can infer that it is not 
like 'lord', even ignoring what the sound of '6' is. In fact, when a new word 
is read, the doubt is typically between two possibilities: the regular and the 
irregular way. The annotation just sorts this out. 

Given an annotated text, a reader can take advantage from annotations at several 
degrees. They can be ignored, and the text can be read as any other plain text. It 
can be used to distinguish regular cases from exceptions, or to place the stress in 
some words. Many annotations are intuitive or just obvious, such as the deletion 
symbol, as in 'debt', so even the first time an annotated text is used, there arc 
some basic understanding. More advanced (or interested) readers will soon make 
the small effort to take a look at the rules or to infer them by reading. In the 
end, if wc have been able to infer thousands of pronunciation exceptions in the 
English language, we can also manage with a set of rules. 

Having said this, there is of course an interesting discussion about the trade- 
ofi^ between number of rules and percentage of annotations. One of the criteria 
we have followed is that the rules should be tuned to favour the most common 
words. That is the reason-why we have preferred a voiced 'th' and 's' in some 
situations (instead of a voiceless one as the by-default case for every position). 
Even though we need two rules for 'th' and 's', it saves many annotations in 
frequent words. In the appendix we discuss some other rules we did not finally 
include, because wc thought (and think) that the set of rules must be simple 
and easy to learn by average people. Nonetheless, the exclusion or inclusion of 
rules can be done after a proper frequency analysis (this is what we have done 
to a greater or lesser degree for the introduction of our rules). In any case, the 
analysis of any old or new rule must take into account the frequency of words^"^, 
and very especially the most frequent words. According to [1], the first 25 most 

A very good resource is http://en.wikti0nary.0rg/wiki/Wikti0nary:Prequency_lists 
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common words make up about one-third of all printed material in English. The 
first 100 make up about one-half of all written material, and the first 300 make 
up about sixty-five percent of all written material in English. 

In Table 19 and Table 20 we show some preliminary statistics (computed 
from the first five paragraphs for the entry 'bed' in wikipedia in different lan- 
guages) where we see that the proportion of annotations is high. Nevertheless 
there are some European languages using the Roman alphabet with a higher 
annotation/diacritics ratio. This means two things. First, English spelling is not 
so irregular, provided a consistent set of rules is defined for it. Second, we should 
not obsess about reducing the annotation ratios by introducing many more rules, 
since the values are comparable to other languages which use diacritic-s and their 
speakers are relatively happy with them. Additionally, the annotation process 
will be automatic, so we should not care about writing in ''Annotated English"'. 



Indicator 


Value 


Annotations per Letter 


1/12 


Annotations per Word 


1/3 


Annotated Word per Word 


1/4 



Table 19. Some preliminary statistics for Annotated English. 



Annotations/Diacritics per Letter 


Value 


Annotated English 


1/12 


Czech 


1/9 


Polish 


1/12 


Turkish 


1/15 


Swedish 


1/20 


Danish 


1/25 


French 


1/30 


Spauisli 


1/40 



Table 20. Some preliminary statistics for Annotated English compared to the diacrit- 
ical systems used in other languages. 



Another criticism against '"Annotated English"' is that if we get used to it, 
then we will be unable to read in plain English. First of all, if we get used and 
feel comfortable with something, that means that it is useful, which is the main 
goal of this proposal. According to current computer technology, it should not 
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be a problem to annotate documents on demand. In any case, if one thinks that 
some people get a strong habit to ''Annotated English" that it seems impossible 
to deal with plain English language again, we can take a look at the experience 
with other languages. For instance, in Spanish, a text without (or with wrong) 
diacritics can irritate many, but it is still well read and understood by all. 

One of the issues we have not addressed in detail is how to annotate words 
which are pronounced differently in several dialects (some of them very common, 
such as 'from' , 'of, 'real'). As we said in the introduction, the annotation 
for a text must choose one dialect. But it is important to be consistent. The 
entire document must be annotated with the same dialect and it should also be 
consistent with the spelling (e.g. it would be strange to see 'fast tumour' or 'fast 
tumor'). Of course different annotations can be used in a novel or a conversation 
when people with different accents participate. In some cases, an annotation is 
valid for more dialects than a more specific one. For instance, 'tier' is valid for 
both GA and RP, while 'tier' is only valid for RP. 

Typography for ^Annotated English" is of course a matter of taste. There 
are of course many other choices for the symbols. Our criterion has been to use 
frequent symbols that can be easily recognised, and to avoid symbols which can 
be confused with others. The distinction between upper annotations (vowels) and 
bottom annotations (consonants) is also the result of our criterion for clarity. It 
is important to highlight that any annotation covering two or more letters can 
be reduced to an annotation for one letter plus deletions. This makes typography 
much easier, since we can use single letter accents in text processors and editors, 
as well as in environments where we cannot play freely with the symbols as in 
(the text processor we have used to produce this document). 

Finally, even though ''Annotated English'' is not a spelling reform, it could 
be used as a basis for it, or at least for a spelling reform based to reduce the 
number of annotations (e.g. 'arg' 'ar') or the most awkward ones ('Maghreb' 

Miigreb), in such a way that the words that require the rarest annotation 
symbols could be modified first. Nonetheless, we will not pursue this idea further, 
since we are not convinced that a spelling reform is a good thing for the English 
language, as we have mentioned in the introduction. 

The appendices include some ideas on rules we excluded, but also some ideas 
on the use of diacritical accents for homographs, as well as some notes on WT^^. 
Nonetheless we have plenty of work to be done before going back to these issues. 
Let us just enumerate some of the possible future work: 

— Construct an automatic interpreter (a reader) for ''Annotated English'', tak- 
ing an annotated text (using, e.g. a notation such as the commands 
used here or an XML approach) and converting it into an IPA representation. 

— Construct an automatic annotator. This is a more ambitious project, that 
would require dynamic programming techniques (to choose the most efficient 
annotation according to the costs) and also some disambiguation techniques. 
It also requires access to a corpus of IPA pronunciations for thousand of 
words (there are some which are freely accessible on-line). 
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— Analysis of the rules by the reckoning of statistics for each rule (ratio of 
annotations using it or not using it). This task can be boosted if we have 
an annotator. An interesting approach for this analysis could be made by 
using the Minimum Message Length (MML) principle [8] [7] , where an MML 
coding would be used to transmit the pronunciation of an English text given 
its spelling. In other words, if sender and receiver share an English text 
and the receiver does not have any notion about English pronunciation, 
how can we devise a code such that wc minimise; the; message including 
the pronunciation information? MML coding does not care about intuitive 
simplicity (only information efficiency) and it is very sensitive to the size of 
the documents (for infinite documents, everything would be embedded into 
rules) but it is an interesting idea as a principle. 

— Incorporate ''Annotated English'' in an English for foreigners programme, or 
in a pilot programme using material following the annotations. 

— Incorporate '^Annotated English^ in a spelling learning programme (e.g. for 
children), or in a pilot programme using material following the annotations. 

Only after these tasks we will be able to evaluate the real usefulness and impact 
of this proposal. 
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A Appendix. Alternative Rules and Additional Symbols 

In this appendix we discuss some rules that could be included or excluded. We 
also comment on the reasons why we finally took the decision of excluding or 
including them, respectively. This appendix is only recommended for people who 
wants to find explanations for some of the choices or wants to open a discussion 
on some of the rules. 

A.l The n Most Common Words 

A very effective way of reducing annotations is to insert the n most common 
words into the pronunciation (interpretation) rules. We arc against this option 
in principle, since the good thing about having annotations on common words 
is that they help learn the pronunciation of other, less frequent, words with the 
same annotation. 

For instance, the five most common words according to [5] are: (the, of, to, 
and, a). If we eliminate annotations for the five of them, we have significant 
reduction in the annotation ratio, as we discussed in the conclusions. 

Perhaps we can treat differently those words which change from a strong 
from and a weak form. In fact, the first most common words have this property, 
as shown in Table 21. The use of a table such as this as a rule (keeping these 
five words without annotation) would reduce the ratio of annotations from 1/12 
letters to 1/13 letters (approx). 



Word 


Strong 


Weak 


the 


the ([3i:]) (before vowel) 


(unstressed) the ([3a]) 


of 


of (M), of ([av]). 


(unstressed) of ([av]), (unstressed) of / of([af]/[a]) 
(before voiceless consonant) 


to 


to ([tu:]), to ([tu]) 


(unstrossod) to ([to]), to ([tu]). (unstr(-ss{>d) to 

(["D 


and 


and ([aend]) 


(unstressed) and ([and]), (unstressed) and ([an]), 
and ([n]) 


a 


a (N) 


(unstressed) a ([a]) 



Table 21. Strong and weak forms for five most common words 



A.2 The ed/es Rule 

The cd/es rule could be removed, simplified or extended. For instance, it could 
be extended to cover the regular case where 'ed' and 'es' are pronounced [id] and 
[iz] , which is in the rules of past participle formation and plural formation. For 
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instance, 'beated' and 'bridges' would be left unannotated. We have not included 
this (even it is well-known by speakers) because we want to stress the distinction 
between 'loaves' and 'bridges', and 'beated' and 'stuffed'. 

Another issue could be to convert 'ed' into 't' when preceded by a voiceless 
consonant group (to avoid the annotation in ■stufFcid'). This would be consistent 
with what we do with the 's', where we do not need annotations for 'loaves' and 
'rats' even though the 's' are voiced and voiceless respectively. 

We consider the combination we present here more consistent, but we under- 
stand that this is of course open for further discussion and analysis. 

A. 3 The Rules for Natural and Plain Positions 

Given the rules for natural and plain positions, it is easy to realise that many 
more natural positions require a plain annotation than otherwise. In fact, the 
vowels in natural position in technical or new words are rarely pronounced with 
the natural sound. 

An interesting study would be to simplify the rule and consider all positions 
plain, except when stressed at the end of a segment. For instance, words such 
as 'be' and 'go' would be considered natural positions, but 'plane', 'code', etc., 
would be considered plain positions, so requiring an annotation. Although this 
looks contrary to the traditional English spelling and pronunciation rules, it is 
competitive in terms of annotations. For instance. Table 22 shows a text with 
the annotation system we have presented in this document on the left, and it 
shows the same text with the modification of the natural position rule on the 
right. 

While in the previous text the affected annotations are 11 against and 5 in 
favour, in other kinds of texts, the difference may be more balanced. A more 
complete study of frequencies should be done. If the statistical study gives a 
more balanced situation there would be reasons to modify the rule (the new rule 
is easier and an annotation symbol could be removed) and reasons to keep it as 
it is (the rules for natural and plain position match the implicit or explicit rules 
we use about the English language). 

A possible compromise would be to reconsider the situation of the groups #1 
and #r as single consonants. It is sometimes difficult to realise that words such 
as 'publicize' and 'Declaration of require annotations to make the plain sound. 

A. 4 The Pronunciation of 's' after a Vowel 

The rule for 's' understands this letter as one which oscillates between [s] and [z]. 
For the [z] sound we have 'z', and for the [s] sound we have 'ce', 'ci', 'ss'. So 's' is 
considered a transition consonant having both sounds depending on the context, 
including other words nearby (such as 'This is bad' or 'This is terrible'). Our 
rules try to accommodate the voiced sound between vowels in common words 
('these', 'easy', 'cause', 'music', etc) and also at the end of the word following a 
vowel or a voiced consonant ('as', 'is', and many plurals). It is also common in 
combinations 's' + voiced group ('Oslo', 'atheism', etc.). 



55 



Annotated text with the rules used 
in this document 


Annotated text with the same rules 
but the natural position rule 


It was the best of times, it was the worst 
of times, it was the age of wisdom, it was 
the age of foolishness, it was the epoch 
of belief, it was the epoch of incredulity, 
it was the season of Light, it was the 
season of Darkness, it was the spring of 
hope, it was the winter of despair, we 
had everything before us, we had nothing 
before us, we werg all gomg direct to 
Heaven, we werg all going direct the other 
way - in short, the period was so far hke 
the present period, that some of its noisiest 
authorities insisted on its being received, 
foi' good or for CYi\. in tli<> superlative 
degree of eoiiip;iris(jii only. 


It was the best of times, it was the worst 
of times, it was the age of wisdom, it was 
the age of foolishness, it was the epoch 
of belief, it was the epoch of incredrility, 
it was the season of Light, it was the 
season of Darkness, it was the spring of 
hope, it was the winter of despair, we 
had evgrything before us, we had nothing 
before us, we were all going direct to 
Heaven, we were all going direct the other 
way - in short, the period was so fax Eke 
the present period, that some of its noisiest 
authorities insisted on its being received, 
for good or for evil, in the superlative 
degrei' of ctjiuparisoii only. 



Table 22. Difference between an annotated text with the rules explained in this doc- 
ument (left) and with a modification on the natural position rule (right) 



However, these rules have many exceptions. For instance, it is typically voice- 
less after a vowel in these endings (-ous, -itis, -isis, -us, as well as very common 
words such as 'this' and 'us'), many common words make it voiceless between 
vowels ('house', 'cease', 'basis') and most technical words ('parasite'). 

A further sophistication would be to make it voiced between vowels (or at the 
end of the segment after a vowel) when the vowel before the 's' has the stress. So 
it would work well for 'these', 'easy', 'cause', 'music', but also for endings such 
as '-ous', '-itis', partially '-isis', -us. But we would require to fix the plurals (and 
they arc not always after e, such as lamas, minis, gurus). 

Summing up, we are closer to consider 's' voiceless for every situation than 
making the rule more complex. 

A. 5 Annotating 'r' or the Vowel to Distinguish Rhotic from 
Non-Rhotic 

We have decided to annotate the 'r' to make it non-rhotic using the symbol r. 
Another option we considered at an early stage was to make it non-rhotic when 
the vowel was annotated. For instance, 'lorry' would become 'lorry', and 'syrup' 
would become 'syrup'. However, there are problems to distinguish between firing 
and pirate (the first is rhotic and the second is not). Annotating 'pirate' and not 
'firing' would be an obscure way to transmit the idea. So we finally decided 
to consider the r. Another issue is to consider it equivalent to a double 'rr', 
which forces us to write 'pirate', but avoids the y in 'syrup', which is, by far, 
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more common. So we finally saw that it was clear and economic in terms of 
annotations. 

A. 6 Use of Introduction and Replacement Symbols for 'y' and 'w' 

A possibility is to eliminate the annotations " and 

The first can be easily eliminated since we have already introduced the an- 
notations for 'cellular' and 'failure', instead of 'cellular' and 'failure'. We could 

Y Y 

do similarly with the following: million — > million. The reason why we have not 

implemented this is because we already have the symbol and the sound (e.g. 
tortilla), so it does not solve anything (it is merely an aesthetic thing). How- 
ever, the same reason could be used to eliminate the diphthong annotation for 
'cellular' and 'failure'. 

For 'J, it is more difficult to eliminate this annotation completely, because we 
have words such as penguin and one. The sound of ^one is closer to the sound 
of foie, but not identical. This suggests a possible switch, and to annotate one 
(6 = [wa]) and fgie (i = [wa:]). Nonetheless, we think these combinations would 
be more difficult to learn and are less intuitive ('one' is simpler than Jpne', 
but it hides that we have the same sound as in 'money', and also the strange 
consonantic sound 'w'. 

A. 7 One (or no) Diphthong Symbol 

We have two special symbols for diphthongs: ' and ' ''. We initially considered 
none of them, and used ' " ' and ' " ' to cover the exception sounds, but it was too 
confusing to have 'love' and 'now'. 

Our first idea was to make combinations, such as in 'backgroiind', 'now', 
'tail', 'Laos', etc. But this is not accurate in phonologic terms. So we considered 
the introduction of a new symbol for diphthongs, as follows: 
Wide Diphthong. Single letter (\dip{a}): a. Double letter (\ddip{ae}): ae. Be- 
fore i, it goes like this: (\dip{\i}):T. 

Examples: background (better annotated as background), now, tmi, Laos, 
foie, Freud, frSi, ^e, Dalai. 

Table 23 shows how it works. 

However, this was against the rule that the annotation for a digraph should 
be equivalent to the annotation of the first vowel + the elimination of the second, 
and we did not have that Pridian = Preydian if we want eye = eye. So we finally 
decided to introduce two diphthong symbols. 

A. 8 The wa-, qua-, wo- Rule 

The sound for 'want', 'was', 'qua', etc., is so common that it may suggest a rule. 
But the variety of sounds (water, wane, whale, wall, want, war, ware, wallet, 
wander) makes it somehow impractical. In any case it should be merged with 
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Unit 


IPA 


rhotic 


Examples 


Comments 


a, ai, ay, ay, ei, ey 


[ai] 


[aiar] 


eye, either, Einstein, Fahrenheit, 
Dalai, mSstro, paella, Hawaii 


Alt: Sye / gyg, gither 
/ gither, Dalai, 
mSestro 


au, aw, ao, ou, ow 


[au] 


[auar] 


now, tail, Laos, northbound (al- 
though 'northbound' is preferred) 


Alt: now, tan, Laos / 
now, tau, Laos 


e, eu, ew 




[aiar]] 


Freudian 


Alt: Freudian / 

Freudian 


0, 01, oy 


[wa:] 


[wa:r] 


fcne, patcng, memoir. Note: 


Alt: foie, choir / foie 



Table 23. Pronunciation of the diphthong annotation 



the qua- rule (quantity, quarrel, but quark, quarter, quake), so considering the 
'wa' combination. 

The rule would be (as a two- vowel unit), such as this: 

— "wa in plain position" [wd] : want, was, what, watch, wad, wallet, wan- 
der, wallaby, warrior, quality, quantity, quality, quarrel, squatter (exception: 
whack, water, wall, wag, wax, waggon) 

— "wa in natural position (or with other vowels)" — > [wei] : wane, whale, ware, 
waist (exception: quadrant) 

— "wa in plain position (rhotic)" [wDir] : war, quarter, warn, warm. 

This could also include the wo- rules, which typical sounds opaque: woman, 
womb, wolf, wound, would, ... , an in rhotic position with word, work, worse, 
worm, with many exceptions as well (women, sword, worry, won, wonder) 

We discarded this because we wanted to make the natural and plain positions 
limited for the natural and plain sounds. Otherwise, it would have been much 
more difficult. 

A.9 The -al- / -all, -ol / -oil Rule 

The sounds for 'a' and 'o' before T arc; typically affected in a systematic way. 
Again, a variety of sounds makes it impractical: all, wall, call, walk, talk, wallet, 
alter, ally, algae, shall. In any case, the rule could also include the -ol- / -oil rule 
(control, roll, cold, bold, fold, sold, gold, told, poll) which are frequently natural 
and not plain), but (doll, follow, holly, alcohol, ...). 

A. 10 The -ing- Rule 

It would be useful in many cases to consider -ing- a block, to avoid separations 
in 'going', 'bemg', 'ageing', etc., which do not take place for 'skiing', 'doing'. 
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But we would have some exceptions: Laing, boing (both with a diphthong 
before then ng sound), vainglorious (vain, glorious), and we would have problems 
with 'fatiguing' (fatigu, ing), or 'bilingual' (bil,ing,ual), depending on the order 
of the rules. Perhaps, we would also have problems with 'ingenious' and similar 
words. It would also create confusion with 'point', 'joint'. Leaving it explicit as 
we have done helps to avoid a typical confusion for foreign people to read 'going' 
as 'going'. 

Furthermore, another reason to dismiss this rule is that we did not like to 
consider trigraphs in our system. 

A. 11 The gu- Rule 

An alternative would be to consider the 'u' (after 'g') silent in any case. It would 
go well for 'gyest', 'gyilt', 'vogue', 'guard' and 'gyarantee'. Another option would 
be to consider the u as w for every case. It would help for 'penguin', 'distinguish', 
etc. 

Given the frequencies, the similarity with the -ce, -ci and -ge, -gi rules, we 
finally decided to distinguish 'gue' and 'gui' from the rest. 

A. 12 Stress and Separators 

Initially, there were no rules for stress. The relation between secondary stress 
and primary stress in the way that a secondary stress is assuming two vowel 
units on the left can, of course, be reconsidered. We only consider this on the 
left, but not on the right. The reason is that this happens (almost always) on 
the left, but on right it only happens sometimes (and it depends on the dialect, 
as the word 'territory'). 

The connection with the sounds " and " is certainly innovative, but it 
is almost always true. 

There are other possible rules, such as that the primary stress is always on the 
second vowel unit from the right if the word ends with a 'c', such as 'automatic'. 
But we should consider other suffixes such as 'ical', 'isis', etc. 

Separators are a delicate thing. Separators afi'ecting stress imply many things 
at the same time. They can be handy, but they can also be difiicult to understand. 

A. 13 Reduced Vowels 

Any sound can be unstressed using the appropriate annotation. For a single 
vowel, the most common unstressed sounds are [a] for a, e, o and u, [ou] for o, [i] 
for e (e) and i, and [u]/[ju]/[ja] for u. Many of these sounds can be annotated. 

The problem appears when the same letter may have different sounds de- 
pending on the reader's speed or emphasis. For instance, emission and omission 
can soiind differently for many dialects if pronounced carefully, but they be- 
come indistinct in some dialects if read quickly. There are different situations, 
depending on the speed and the dialect. 
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lentil [i]/[8] 
omission [ou]/[8] 
beautiM [u]/[8] 
cellular [ju]/[j9] 
emission [i]/[a] 
ammunition 
failure [p] 
figure [j9]/[a] 
Mercury [ju]/[j3] 
Chocolate [i]/[a] 
tenure [ja]/[ju8r] 
telephone [i]/[8] 
futile [ai]/[9] 
fortune [a] 

In some cases, this is related with the word being rhotic or not, and also to the 
cases in section A. 14. At the bottom of table 3, we can see that even the IPA has 
some symbols for these situations ([ja], [e], [u], [i]), because they do not only 
oscillate but they can even be a different phoneme. 

A possibility would be to include symbols that could account for these vari- 
ants, so annotating a word in such a way that it would be valid for different 
pronunciations of the word and even for different dialects. A proposal we con- 
sidered is: 

Reduced. Single letter (\red{a}): a. Double letter (\rred{ae}): ae is not used. 

Examples: emission, omission, curriculum, ammunition 

However, this is somehow against the philosophy of producing one pronun- 
ciation given an annotated word. Consequently, in cases where the difference is 
caused by speed, we choose the pronunciation when the word is read slowly and 
carefully. In cases where the difference is originated by dialect, we have to choose 
one of them. 

A. 14 Unstressed di-gi/ji-ci/sci/ssi-si-sti-ti-xi-zi Followed by Vowel 
and du-ju-ssu-su-tu-xu-zu 

There is an implicitly (or explicitly) well-known rule in English about the pro- 
nunciation of words such as passion, vision, nation, pressure, leisure, nature, 
etc. We have of course analysed a possible rule form them. Table 24 summarise 
the possibilities. Table 24 is restricted to unstressed combinations. For stressed 
cases, the sound is not altered, as in science, June (but sure and sugar). But in 
some dialects, dune sounds [ds], tune sounds ([tj]. 

In the table we also show that some cases would be excluded by previous 
rules (denoted by *), and some cases have several pronunciations (denoted by 
+)• 

We also have cases where -ge-, -ce-, -te- sounds -ci-, -ti-, and we have positive 

cases such as dungeon, surgeon, ocean, crustacean, violaceous, but also negative 
cases such ossean, Piscean, Odysseus, and other cases just because -ea- is taken 
as a cluster. 
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Combination 


IPA 


Examples 


Exceptions 


Excluded 


di + a/e/o 




soldier 


media, insidious, 

v^i H 1 ft 1 TTipri 1 


sodium 


gi/ji + a/e/o 




religion, Belgian, 
contagious 


vestigial, collegiate 


orgies* 


ci/sci/ssi + a/e/o 




conscience, passion, 
precious 




calcium, 

cesium, 

fancies* 


/^/^Tl C/^"K1 OItI" I C?l 1 

L-UllbUlicLlHj d1 

a/e/o 




tclioiOli , cXC Ul olUIi , 

pension 




v^cibiubn^ , 

pansies* 


vuwci oi cL/tJ/u 


L38J 


riolclll, tJUtliailcLoicL, 

vision, illusion 


CUalcb I , diilliicoian^ , 

Cartesian+ 


gillindoi Ulll 


oil cL/ e/ u 


Fst Tal 

[stjaj 








ti + a/e/o 




nation, inertia, cau- 
tious, substantial, 

patience, initial 


Equation ([39]), 
initiate (the ver 

ic 1 1 Tils Qn/H ^"r>o 
Id LJ J a-iJ-U tile 

noun is [Jii], but 

i Lr lo lie V CI M 1 J , 

prettiest, mighties 


Pentium, 
beauties* 


Yl —1— / / n 




ion, anxious 


CIA.IUIII, ClllvJl vJjVlCt, 

asphyxiate 




71 —1— Si / / n 


kal 




l\. LJJYilcXZjldll, 

trapezial 




du + 

d / c / O / 1 ^ / 1 OiUlc/ 1 ilig 


[d38]/[d3U8] 


procedure, graduate, 

rrT*Q/H n q'Hi^ti 


arduous+, proce- 

HnVEil— 1— ITT'QHnQl— 1— 

verdure+ 


residuum 


ill 4- 

a/e/o/r@ /rable /ring 




TmnT*o 'r\OT'mT'o 
lllJUlc, ptJlJlilt; 






ssu + 
a/e/o/r@ /rable /ring 


[d3a]/[d3ua] 


tissuc+, issue-h, pres- 
sure 






consonant + su + 
a/e/o/r@ /rable /ring 




censure 


sensual, commen- 
surate 




vowel -|- su -t- 
a/e/o/r@ /rable /ring 


[38]/[3"8] 


casual, usual, mea- 
surable, measuring. 






stu + 
a/e/o/r@/rable/ring 


[Stj8]/[stju8] 


gesture 




perturbation 


tu + 
a/e/o/r@ /rable /ring 


m/[tsu9] 


nature, century, 
mutual, cultural, 
Portuguese, even- 
tual+, tortuous, 
saturation+, satu- 
rated+, spatula+ 


fatuous+, situa- 
tion+ 




xu + 
a/e/o/r@/rable/ring 


m/[kSu9] 


luxury 


sexual-l- 




zu + 
a/e/o/r@ /rable /ring 


[38]/[3«8] 


seizure 







Table 24. Unstressed di/gi-ji/ci-sci-ssi/si/sti/ti/xi/zi and du/ju/ssu/su/tu/xu/zu 
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Apart from all these exceptions and special cases (and the length of the 
table to describe all the combinations), wc would need annotations to say that 
the effect does not take place. This means that we would require annotations for 
the normal t, for the normal d, for the normal x, for the normal z. This is the 
main reason why wc did not adoptcid this. Because it could create cases where we 
would not be able to find an annotation to undo the rule (perhaps the separation 
symbols, but this would be very awkward, like in 'gimnasium' or mightiest. 

Related to that we have to mention the endings -lure, -nion, -nious, -lia, -nia, 
etc. In some cases they require a sound [j] (as in failure, million and onion), and 
in other cases they do not. 

A. 15 Diacritic Use of Annotations for Homographs (Homophone 
and Not-homophone) 

There is a possibility to annotate a word which would not require an annotation 
in order to distinguish it from its homograph. In the cases where the homographs 
are not homophone, it is clear that both words will have a different annotation 
(such as wind and wind, or wound and wound). In the first case, we can get 
confused if we get used to annotated text and we eventually read non-annotated 
text. Any appearance of the word 'wind' will be assimilated with the first variant, 
even though in non-annotated English it could be both of them. We think this 
is slight problem since the reader must realise the context. The problem can be 
more serious if we just read some words (a slogan, a sign, etc.). If a few words 
have no annotations, it might be an annotated text, but this possibility is quite 
unlikely. So we will not care about this. 

Another different situation is when two different words are homographs and 
homophones. In this case, the annotation would be the same. For instance, we 
have 'run' for the infinitive and for the past participle. Since none of them have 
an annotation, we could write 'nin' for the participle to distinguish it from the 
infinitive. This technique is used in other languages, such as Spanish, where it 
is called a 'diacritical accent'. It is an interesting idea, but we will not develop 
it at this moment of the proposal. 

B Appendix. Some MT^jX Stuff. Packages and Symbol 
Construction 

This appendix is intended to people who wants to use WF^. As it is possible 
that some annotators would produce text which could be compiled into 

PDFs using WT^, this may be useful to people who are not regular users of 

The good thing about using or any other command-based or mark- 

up language is that wc can define commands or labels for several annotations 
and then modify their graphical representation. For instance the annotated word 
'showroom' is coded in X^T^jX as follows: 
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show\se{}r\oopq{oo}m 

This can be easily converted into XML as: 

show<se/>r<oopq>oo<oopq/>in 

And represented graphically with a proper style sheet. 

B.l Packages used and new symbols we created 

This section describes the packages we need in WT^. Here they are: 

\usepackage{amssjnnb}- many mathematical symbols 

\usepackage{amsmath} °A many mathematical symbols 

\usepackage{yhmath} °A \widetriangle 
\usepackage{tipa} many 

\usepackage{phonetic} \textipa and related symbols 
\usepackage{extraipa} \subdoublevert 

\usepackage{mathabx} % \widecheck 

\usepackage{stmaryrd} % \Ydown 

\usepackage{harpoon} % \overlef tharpdown, \overlef tharp 
\usepackage{mathtools} \overbracket 

\usepackage-Caccents} °/, fabrication of new accents (\accentset, \underaccent) 



When using multiple packages we may encounter incompatibilities because a 
package tries to redefine a command, which has been already defined. For in- 
stance, when using the LNCS documentclass, we may get a warning when loading 
package 'amsmath': 



Unable to redefine math accent \vec 



This can trigger further errors on other packages, such as in accents. sty, with 
something like "Argument of \vec has and extra " , because the right redefinition 
did not take place. One way of solving this is surrounding the 'documentclass' 
declaration as follows: 



\let\accentvec\vec 
\documentclass [f leqn] {lines} 

\let\spvec\vec 
\let\vec\accentvec 



B.2 Manual Installation of Packages 

Package "extraipa" is not standard, so it has to be installed manually. The 
mathabx fonts can be found here: 

http : //www . ctan . org/tex-archive/f onts/mathabx/ 
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We have to downloaded mathabx.zip from there and unzip it. Inside that docu- 
ment there are: 

~ 54 files ill a folder named 'source'; all 54 of those files had the extension .mf; 

— 4 files in a folder named 'texinputs': mathabx.dcl, mathabx.sty, mathabx.tex, 
testmac.tex 

— Other 4 files outside these folders. They are examples. You can ignore them. 

Then, the installation depends on the X^T^X system you arc using. Here I will 
describe that for MIKTEX for Windows, but it is similar for other distributions. 

Go to where your miktex distribution is located, and then look for the fonts' 
source folder. Typically something like this: 

c:\Prograin Files\MikTeX 2 . 7\f onts\source\public 

Create a new folder called 'mathabx' and put all 54 of the .mf files in there. 

Then go to to the folder 

c:\ProgrELm Files\MikTeX 2 . 7\tex\generic\misc 

Create a new folder called 'mathabx' and put the other four files (mathabx.dcl, 
mathabx.sty, mathabx.tex, testmac.tex) in there. 

Then, on the Windows' 'start' go to Miktex2.7 Settings. On the window 
that comes up, click the buttons "Refresh FNDB" and then "Update Formats" . 

That's it. You should compile fine with this. 

B.3 Creating Commands 

The basic commands in text mode are created as follows. For instance, the 
separator is just created as: 

\newcommand-C\se} [1] -C\textraising{#l}} 

We can join two symbols into one, just putting them together. For instance, the 
separator is created as follows: 

\newcommand-[\selr} [1] {\sel{}\textsubplus{>} 

Some accents needed to use the math environment to cover two letters. For 
instance, this is how we constructed a tilde symbol covering two letters: 

\newcommand{\nnat} [1] {$\widetilde-C\inbox-[#l}}-$} 

For the breve we (in the mathabx package) we did similarly. 
\newcommand{\oopq]- [1] {$\widecheck{\mbox{#l}}$} 

For the cross beneath the letter, we managed to create a new symbol called 
\textsubcross from the package tipa. 

\newcommand-C\textsubcross} [1] {\tipaloweraccent{24}{#l}} 

We used it to define the \si command. 

\newcominand{\si} [1] {\textsubcross{#l}} 
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B.4 List of Command Definition 



Copying the package loading instructions and the foUowing command definitions, 
we can compile any annotated English text with notation. 

\newcommand-[\textsubcross} [1] {\tipaloweraccent{24}{#l}} 

7, SHORTCUTS 
°/o Annotated 

\newcominand-C\annl} [1] {\textopencorner{#l}} 7.{$\llcorner{}{\mbox{#l}}$} 
\newconimand{\annr} [1] {\textcorner{#l}} 7.{$\lrcorner{}{\inbox{#l}}$} 

7o Non-einnotated 

\newcommand{\nonl} [1] {$\llcorner{}{\mbox{#l}}$} °/o{\textopencorner{#l}} 
\newcommand{\nonr} [1] {$\lrcorner{}-C\mbox{#l}}$} y,{\textcorner{#l}} 



7, Silent, stress and separators (beneath) 
7, Hint: all start with 's' 
\newcoiiimand{\si} [1] {\textsubcross{#l}} 
\newcommand{\st} [1] {\textsyllabic-C#l}} 
\newconimcLnd{\stst} [1] {\subdoublevert{#l}} 
\newcommand{\se} [1] {\textraising{#l}} 
\newcommaiid{\sel}- [1] {\textadvancing{#l}-3- 
\newconimcind{\ser} [1] {\textretracting{#l}} 
\newcommand{\selr} [1] {\sel{}\textsubplus{}} 
\newcommand{\serl} [1] ■[\textsubplus-C}\ser{» 

7o Consonants (beneath) 
7, Hint: all end with 'o' 
\newcommand{\co} [1] {\textsubring-[#l}-} 
\newcommand{\io} [1] {\d-[#l}} 
\newconunand{\vo> [1] {\textsubwedge{#l}} 
\newcominand{\no> [1] -[\textsubcircum{#l}} 

\newcominand{\svo} [1] -[\textinvsubbridge{#l}> 
\newcominand{\sno} [1] -[\textsubbridge{#l}} 

\newcommand-[\hvo} [1] -[\b{\textinvsubbridge{#l}}} 
\newcommand-[\hno} [1] -[\b{\textsubbridge{#l}>} 
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7, Semiconsonant (w) 
\newcommand{\w} [1] {\textsubw{#l}} 
°/o Semiconsonant (y) 

\newcomnicind{\y} [1] {$\underaccent{\Ydown>{\mbox{#l}-}$} 
°/o Vowel Introductions 

\newcommand{\sch} [1] {$\accentset{\backepsilon}{\mbox{#l\upbar{}}}$} % package textcomp 
\newcommand{\ssch} [1] {$\accentset{\backepsilon}{\mbox-[#l}}$} % package textcomp 

°/o Vowel Annotations 
°/o Hint : all with three letters 
\newcommand{\pln} [1] {\textvbaraccent{#l}} 
\newcommand{\ppln} [1] {\textvbaraccent{#l» 

\newcommand{\nat} [1] {\~{#1» 

\newcommand{\nnat} [1] {$\widetilde-[\mbox{#l}}$} 

\newcommand{\brd} [1] {\={#1}} 
\newcommand{\bbrd} [1] {$\overline{\mbox{#l»$} 

\newcommand{\rnd} [1] {\r{#l}} 
\newcommand{\rrnd} [1] {\r-[#l}} 

\newcommand{\clr} [1] 

\newcommand{\cclr} [1] {$\widehat{\mbox{#l»$} 
\newcommand{\opq} [1] {\v{#l}} 

\newcommand{\oopq} [1] {$\widecheck{\mbox{#l}}$} 

\newcommand{\dip} [1] {$\widetriangle-[\mbox{#l}}$} % Problems 
\newcommand{\ddip} [1] {$\widetriangle{\mbox{#l»$} 

\newcommand{\idp} [1] {\ ' {#1}} 

\newcommand{\iidp} [1] {$\overlef tharpdown{\mbox{#l}}$} 
\newcommand{\udp} [1] {\ ' {#1}} 

\newcommand{\uudp} [1] {$\overlef tharp{\mbox{#l»$} 

\newcommand-C\iot} [1] -[\ . {#!}} 

\newcommand-C\cnt} [1] -CXn*!}} 
\newcommand-C\ccnt} [1] {\"{#1}} 
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\newcommand{\red} [1] {\crtilde{#l}> 
\newcominand{\rred} [1] {\crtilde{#l}} 

"/o Grouping Command 

\newcommand-[\group} [1] {\iinderline{#l}} 
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